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Neural Thread Protein Gene Expression and 
Detection of Alzheimer's Disease 
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Field of the Invention 



The present invention is in the field of genetic engineering and 
molecular biology. This invention is directed to recombinant hosts expressing 
novel proteins associated with Alzheimer's Disease, neuroectodermal tumors, 
malignant astrocytomas, and glioblastomas. This invention is specifically 
directed to the recombinant hosts and vectors which contain the genes coding 
for the neuronal thread proteins. This invention is also directed to 
substantially pure neural thread proteins, immunodiagnostic and molecular 
diagnostic methods to detect the presence of neural thread proteins, and the 
use of nucleic acid sequences which code for neural thread proteins in gene 
therapy. 
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Background of the Invention 

Alzheimer's Disease 

Alzheimer's Disease (AD) is the most frequent cause of dementia in 
the United States, affecting over two million individuals each year. It is a 

5 degenerative brain disorder characterized clinically by loss of memory, 

confusion, and gradual physical deterioration. It is the fourth most common 
cause of death. The etiology of the disease is virtually unknown but has been 
attributed to various viruses, toxins, heavy metals, as well as genetic defects. 
The disease is at present incurable. 

10 Until quite recently, AD was thought to account for relatively few of 

the cases generally classified as senile dementia. Other factors can lead to 
such a condition, including repetitious mild strokes, thyroid disorders, 
alcoholism, and deficiencies of certain vitamins, many of which are potentially 
treatable. It can be appreciated, then, that a diagnostic test specific for AD 

15 would be very useful for the clinical diagnosis and proper clinical treatment 

of subjects presenting with symptoms common to all of these conditions. 

The brains of individuals with AD exhibit characteristic pathological 
accumulations of congophilic fibrous material which occurs as neurofibrillary 
tangles within neuronal cell bodies, and neuritic (or senile) plaques. 

20 Neurofibrillary tangles may also be found in the walls of certain cerebral 

blood vessels. The major organized structural components of neurofibrillary 
tangles are paired helical filaments. Qualitatively indistinguishable amyloid 
deposits also occur in normal aged brains but in much smaller numbers with 
restricted topographical distribution. 

25 There has been considerable recent investigative activity regarding the 

characterization of proteins found in neuritic plaques and neurofibrillary 
tangles of AD and other neurologic diseases. One of the amyloid proteins 
initially described by Glenner et al. has been cloned and sequenced (Glenner 
etal, Biochenu Biophys. Res. Commun. 720:1131-1135 (1984); U.S. Patent 
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No. 4,666,829). The A4 amyloid protein found in neuritic plaques and blood 
vessels has been determined to be a component of a 695 amino acid precursor; 
a protein postulated to function as a glycosylated cell surface receptor 
(Masters etaL, Proc. Nail. Acad. ScL USA 52:4245-4249 (1985), Kang etaL, 
5 Nature 325:733-736 (1987)). In addition, the amyloid protein has been 

postulated to function as a cell adhesion molecule and as a calcium ion 
channel protein (Hooper, /. NIH Res. 4: 48-54 (1992); Rensberger, Wayward 
Protein Molecule May Be Elusive Killer of Brain Cells, The Washington Post, 
January 25, 1993, §1, at A3 (1993)). The gene coding for A4 is located on 

10 chromosome 21 (Kang et aL, ibid.; Goldgaber et al„ Science 255:877-880 

(1987); Tanzi et aL, Science 235:880-885 (1987); St. George-Hyslop et aL, 
Science 235:885-889 (1987)) but apparently is not linked to the familial form 
of the disease (Van Broekhoven et aL. Nature 329:153-155 (1987)). There 
appears to be little, if any, protein sequence homology between amyloid A4 

15 and fl protein, their higher molecular weight precursor, and pancreatic thread 

protein (PIP) (Gross etal., J. Clin. Invest. 76:2115-2126 (1985)). 

A number of other proteins thought to be associated with the disease 
have been described, including Ubiquitin, ALZ-50, microtubular-associated 
proteins r and MAP2, and neurofilament protein (see, for example, Manetto 

20 et aL, Proc. NatL Acad. Sci. USA 55:4502-4505 (1988); Wolozin et aL, 

Science 232:648-651 (1986); Selkoe, NeurobioL Aging 7:425-432 (1986); 
Perry et aL, in: Alterations of the Neuronal Cytoskeleton in Alzheimer's 
Disease, Plenum, New York, pp 137-149 (1987)). More recently, a serine 
protease inhibitor called a r anti-chymotrypsin has been found in AD amyloid 

25 deposits (Abraham et al., Cell 52:487-501 (1988)). 

There is currently no useful diagnostic test for AD being practiced 
clinically. A definitive diagnosis is possible only postmortem, or during life 
through a brain biopsy, to reveal the presence of the characteristic plaques, 
tangles, paired helical filaments, and other cerebrovascular deposits which 

30 characterize the disorder. Such an invasive surgical procedure is inherently 
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dangcrous and is therefore rarely utilized. As a result, the clinical misdiag- 
nosis of AD is estimated to be approximately 20%-30%. 

Thread Proteins 

The prototype thread protein molecule is pancreatic thread protein 
(PTP) which bears the unusual physical property of forming insoluble fibrils 
at neutral pH, but is highly soluble at acid or alkaline pH (Gross et al , 
supra). FTP is highly abundant, synthesized by pancreatic acinar cells, and 
secreted into pancreatic juice in concentrations exceeding 1 mg/ml (Id,). An 
increased thread protein immunoreactivity has been demonstrated in brains 
with AD lesions, using monoclonal antibodies to PTP (Ozturk et al. , Proc. 
Natl Acad. ScL USA 56:419-423 (1989)). In addition, a highly sensitive 
forward sandwich immunoradiometric assay was used to demonstrate that at 
least three distinct antigenic epitopes were shared between PTP and the related 
protein in the brain (Id.) Despite similarities, the pancreatic and neuronal 
forms of the thread protein are almost certainly distinct since the mRNA 
molecules and proteins differ in size, and many of the antigenic epitopes 
which are present in the pancreatic thread protein are not detectable in brain 
tissue (de la Monte et. al., J. Clin. Invest. 56:1004-1013 (1990); de la Monte 
et. al., J. Neurol. Sci. iiJ:152-164 (1992); de la Monte et al., Ann. Neurol. 
52:733-742 (1992)). 

The central nervous system form of the thread protein, designated 
hereafter as "neural thread protein w (NTP), has been identified in AD and 
Down's Syndrome brain tissue (Wands et al., International Application 
Publication No. WO 90/06993). NTP has been found in all AD brains studied 
where characteristic neuropathologic changes of the disease exist (Id.). The 
saline- extractable soluble immunoreactivity shares has a molecular weight of 
approximately 17 to 20 kD (Id.). 

Quantitative measurements of NTP immunoreactivity in various regions 
of AD brains revealed levels varying from 12 to 295 ng/gm tissue (Mean = 
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116 ng/gm tissue) compared to 1-11 ng/gm tissue (Mean = 5 ng/gra tissue) 
in comparable ares of control brains (Id.). 

Immunocytochemistry performed with monoclonal antibodies directed 
against the pancreatic form of FTP demonstrated that NTP is localized within 

5 cells, within fine processes within the neuropil, or is extracellular in both AD 

and Down's Syndrome brains (Id.). Two types of cell contain NTP: neurons 
and astrocytes (Id.). The affected neurons are the large pyramidal type which 
typically contain the neurofibrillary tangles well known in AD brain (Id.). 
That NTP accumulation within neurons is intrinsically important or 

10 integrally related to the evolution of AD lesions is corroborated by the 

presence of identical patterns of immunolabeling for NTP in Down's 
Syndrome brains, but not in control brains (Id.). It is important to note that 
the same structural abnormalities of AD occur in brains of all middle-age 
individuals with Down's syndrome, whether or not they are demented. There 

15 is also a higher incidence of AD in family members of Down's Syndrome 

patients. Moreover, the regional differences in the densities of NTP-contain- 
ing neurons parallels the density distributions of neurofibrillary tangles in both 
AD and Down's Syndrome. This provides further evidence that NTP is 
germane to the pathophysiology of AD. Whether NTP accumulates within 

20 neuronal perikarya, as a result of aberrant cellular metabolism or transport is 

not yet known. 

Summary of the Invention 

A need exists for a definitive diagnostic test which can be performed 
25 on individuals suspected of having, or being at risk for AD. The present 

invention satisfies such needs and provides further advantages. 

The manner in which these and other objects are realized by the 
present invention will be apparent from the summary and detailed description 
set forth below. 
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Unless defined otherwise* various terms used herein have the same 
meaning as is well understood in the art to which the invention belongs. All 
cited publications are incorporated herein by reference. 

This invention is directed to recombinant hosts expressing novel 
5 proteins associated with Alzheimer's Disease, neuroectodermal tumors, 

malignant astrocytomas, and glioblastomas. This invention is specifically 
directed to the recombinant hosts and vectors which contain the genes coding 
for the neuronal thread proteins (NTP) having molecular weights of about 8 
kDa f 14 fcDa, 17 kDa, 21 kDa, 26 kDa or 42 kDa. This invention is also 
10 directed to the substantially pure neural thread proteins, immunodiagnostic and 

molecular diagnostic methods to detect the presence of neural thread proteins, 
and the use of nucleic acid sequences which code for neural thread proteins 
in gene therapy. 

In particular, the invention includes a method for detecting and 
15 quantitating an NTP in a human subject, comprising: 

(a) contacting a biological sample from a human subject that 
is suspected of containing detectable levels of an NTP with a molecule capable 
of binding to the NTP; and 

(b) detecting the molecule bound to the NTP. 

20 The invention additionally includes the method as above, wherein the 

binding molecule is selected from the group consisting of: 

(a) an antibody substantially free of natural impurities; 

(b) a monoclonal antibody; and 

(c) a fragment of (a) or (b). 

25 The invention additionally includes the method as above, wherein the 

detecting molecule is detectably labeled and where a combination of such 

binding molecules is used. 

The invention additionally includes a method for detecting the presence 

of a genetic sequence coding for an NTP in a biological sample using a 
30 polynucleotide probe derived from a recombinant human NTP of this 

invention. 
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The invention additionally includes a method for determining the 
presence of a condition in a human subject, said condition including, but not 
limited to, the group consisting of Alzheimer's Disease, the presence of 
neuroectodermal tumors, the presence of malignant astrocytomas, and the 

5 presence of gliomas. 

The invention additionally includes a method of diagnosing the 
presence of AD in a human subject suspected of having AD which comprises: 

(a) incubating a biological sample from said subject suspected of 
containing an NTP with a molecule capable of identifying an NTP; and 

10 (b) detecting the molecule which is bound in the sample, wherein 

the detection indicates that the subject has AD. 
The invention additionally includes a method of diagnosing the 
presence of neuroectodermal tumors in a human subject suspected of having 
neuroectodermal tumors which comprises: 
15 (a) incubating a biological sample from said subject suspected of 

containing an NTP with a molecule capable of identifying an NTP; and 

(b) detecting the molecule which is bound in the sample, wherein 
the detection indicates that the subject has neuroectodermal tumors. 

The invention additionally includes a method of diagnosing the 
20 presence of a malignant astrocytoma in a human subject suspected of having 

a malignant astrocytoma which comprises: 

(a) incubating a biological sample from said subject, which is 
suspected of containing an NTP. in the presence of a binding molecule capable 
of identifying an NTP; and 
25 (b) detecting molecule which is bound in the sample, wherein the 

detection indicates that the subject has a malignant astrocytoma. 

The invention additionally includes a method of diagnosing the 
presence of a glioblastoma in a human subject suspected of having a 
glioblastoma which comprises: 
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(a) incubating a biological sample from said subject, which is 
suspected of containing an NTP, in the presence of a binding molecule capable 
of identifying an NTP; and 

(b) detecting molecule which is bound in the sample, wherein the 
detection indicates that the subject has a glioblastoma. 

The invention additionally includes the methods as above, wherein a 
biological sample is removed a human subject prior to contacting the sample 
with the molecule. 

The invention additionally includes the methods as above, wherein 
detecting any of the molecules bound to the protein is performed by in situ 
imaging. 

The invention additionally includes the methods as above, wherein 
detecting of any of the molecule bound to the protein is performed by in vivo 
imaging. 

The invention additionally includes the methods as above, wherein the 
biological sample is reacted with the binding molecule in a manner and under 
such conditions sufficient to determine the presence and the distribution of the 
protein. 

The invention additionally includes the methods as above, wherein a 
detectably labeled binding molecule of an NTP is administered to a human 
subject. 

The invention additionally includes the methods as above, wherein the 
binding molecule is bound to the protein in vivo. 

The invention additionally involves an NTP substantially free of any 
natural impurities and having a molecular weight of about 42 kDa. 

The invention additionally involves an NTP substantially free of any 
natural impurities and having a molecular weight of about 26 kDa. 

The invention additionally includes an NTP substantially free of any 
natural impurities and having a molecular weight of about 21 kDa. 
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The invention additionally includes an NTP substantially free of any 
natural impurities and having a molecular weight of about 17 kDa. 

The invention additionally includes an NTP substantially free of any 
natural impurities and having a molecular weight of about 14 kDa. 

The invention additionally includes an NTP substantially free of any 
natural impurities and having a molecular weight of about 8 kDa. 

The present invention also particularly relates to the diagnostic methods 
recited above, wherein the immunoassay comprises two different antibodies 
bound to a solid phase support combined with a third different detectably 

labeled antibody in solution. 

The invention is also directed to a method of producing an NTP, said 

method comprising: 

(a) culturing a recombinant host comprising a human gene 

coding for said NTP; and 

(b) isolating said NTP from said host. 

Additionally, the invention is directed to a substantially pure NTP 

obtained by the such a process. 

The invention is also directed to an 15- to 30-mer antisense 
oligonucleotide which is complementary to an NTP nucleic acid sequence and 
which is nonhomologous to PTP nucleic acid sequences, as well as 
pharmaceutical compositions comprising such oligonucleotides and a 
pharmaceutically acceptable carrier. 

The invention is also directed to ribozymes comprising a target 
sequence which is complementary to an NTP sequence and nonhomologous 
to PTP nucleic acid sequences, as well as pharmaceutical compositions 
comprising such ribozymes and a pharmaceutically acceptable carrier. 

The invention is also directed to a method of achieving pharmaceutical 
delivery of NTP molecules to the brain through acceptable carriers or 

expression vectors. 

The invention is also directed to oligodeoxynucleotides that form triple 
stranded regions with the various NTP genes (nucleic acid sequences) and 
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which arc nonhomologous to FTP nucleic acid sequences, as well as 
pharmaceutical compositions comprising such oligodeoxynucleotides and a 
pharmaceutically acceptable carrier. 

The invention is also directed to the therapeutic use of NTP-derived 
molecules or fragments thereof to modify or improve dementias of the 
Alzheimer's type of neuronal degeneration. 

The invention is also directed to methods for the differential diagnosis 
of sporadic and familial Alzheimer's disease. 

Brief Description of the Drawings 

Figures 1A-1J show neural thread protein immunoreactivity in CNS- 
derived tumors. 

Figure 2 depicts a graph showing neural thread protein levels in 
PNET1, PNET2, A172. C6, and Huh7 hepatocellular carcinoma cells 
measured by a forward sandwich monoclonal antibody-based 
immunoradiometric assay (M-IRMA). 

Figure 3 shows molecular size of neural thread proteins in SH-Sy5y, 
A172, and C6 cells demonstrated by immunoprecipitation and Western blot 
analysis using the Th9 monoclonal antibody. 

Figure 4 shows molecular sizes of neural thread proteins in PNET1 
cells (a) and C6 glioblastoma cells (b) demonstrated by pulse-chase metabolic 
labeling with »S-methionine, and immunoprecipitation with Th9 monoclonal 
antibody (Figure 4A). The molecular weights are 8, 14. 17, 21. 26 and 42 
kDa (arrows). 

Figures 5A-5E depict a series of five graphs showing the 21 kDa and 
17 kDa neural thread proteins in SH-Sy5y. PNET1, A172, and C6 cells and 
the absence thereof in Huh7 cells by SDS-PAGE/M-IRMA. 

Figure 6 depicts a gel showing that the 21 kDa neural thread protein 
in C6 glioblastoma cells is phosphorylated. 
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Figure 7 depicts a bar graph showing altered neural thread protein 
expression in PNET1 cells with growth phase. 

Figures 8A-8F show altered phenotype of PNET1 cells with cessation 
of cell growth and overnight serum starvation. 

Figure 9 shows the l-9a partial cDNA sequence, and Figure 9A shows 
a partial sequence of the second 5' anchor PCR product corresponding to the 
5' region of the l-9a cDNA (WP5' Sequence). 

Figure 10 shows alignment of partial sequences between l-9a and 
human PTP and the Reg gene (the nucleic acid sequence corresponding to the 
genomic clone of human PTP). 

Figure 10A shows alignment between l-9a and Exon 2 of the human 
Reg gene, and between the first 5' anchor PCR product of l-9a (WP03-417) 
and Exon 2 and Reg. 

Figure 10B shows alignment between the l-9a and its second 5' anchor 
PCR product (WP5') and AD 3-4 and AD 2-2 cDNAs. 

Figure 11A shows the partial nucleic acid and deduced amino acid 
sequences of the HB4 cDNA. Figures 11B and 11C show a protein 
hydrophilicity window plot. Hydrophilicity Window Size = 7; scale = Kyte- 
Doolittle. 

Figure 11D shows alignment between HB4 and human PTP. 

Figure 1 IE shows alignment between HB4 and human Reg gene. 

Figures 12A-12C show the expression of mRNA molecules 
corresponding to the l-9a CNS neural thread protein cDNA sequence in 
neuroectodermal tumor cell lines and in rat pancreas. 

Figures 13A and 13B show mRNA transcripts corresponding to the 
l-9a CNS neural thread protein cDNA sequence in human brain. This figure 
also demonstrates higher levels of l-9a CNS neural thread protein-related 
mRNAs in AD brains compared with aged-matched controls (Figure 13A). 
Figure 13B demonstrates four different transcripts with greater abundance of 
the lower molecular size mRNAs in AD compared with aged controls. 
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Figures 14A-14C show l-9a Southern blot analysis of RT/PCR-derived 
cDNAs in neuroectodermal cell lines. A- and B-PCR amplification of l-9a 
mRNA sequences in neuroectodermal cell lines, and using mRNA from 
newborn rat (NB) brain, AD brain, and aged control brain. Figure 14A is a 
5 longer exposure of Figure 14B. Figure 14C shows hybridization of the same 

blot using the 018 rat PTP probe. 

Figures 15A and 15B (SE-RT/PCR) show hybridization of the l-9a and 
018 probes with several clones isolated from SH-Sy5y cells by reverse 
transcribing mRNA and amplifying with primers corresponding to the known 
10 sequence of the l-9a partial cDNA. 

Figures 16A S 1.6D and 16E show the partial nucleic acid sequences of 
the AD 2-2 cDNAs isolated from the AD brain library. Figures 16B and i6C 
show a hydrophilicity window plot of AD2-2 T7. Hydrophilicity Window 
Size = 7; scale = Kyte-Doolittle. 
!5 Figures 16F, 161, 16J and 16K show the partial nucleic acid sequences 

of the AD 34 cDNAs isolated from the AD brain library. Figures 16G and 
16H show a hydrophilicity window plot of AD3-4. Hydrophilicity Window 
Size = 7; scale = Kyte-Doolittle. 

Figures 16L, 16M and 16N show the partial nucleic acid sequences of 
20 the AD 4-4 cDNAs isolated from the AD brain library. 

Figure 160 shows the partial nucleic acid sequences of the AD 16c 
(also called AD 10-7) cDNAs isolated from the AD brain library. Figures 
16P and 16Q show a hydrophilicity window plot of AD16c-T7. 
Hydrophilicity Window Size = 7; scale = Kyte-Dooiitde. 
25 Figure 16R shows the complete nucleotide sequence of the AD10-7 

cDNA clone that was isolated from an AD library. 

Figure 16S shows the complete nucleotide sequence of the AD16c 
cDNA clone that was isolated from the AD brain library. 

Figure 17 shows alignment of partial sequences between AD 2-2 and 
30 human Reg gene. 
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Figure 17A shows alignment of partial sequences between AD 2-2 and 
Exon 1 of Reg and rat FTP. 

Figure 17B shows alignment of partial sequences between AD 2-2 and 

l-9a. 

5 Figure 17C shows alignment of partial sequences between AD 2-2 and 

AD 16c. 

Figure 18 shows alignment of partial sequences between AD 3-4 (also 
called AD 5-3) and the Reg gene. 

Figure I S A shows alignment of partial sequences between AD 3-4 and 
10 the 5' anchor PCR products of the l-9a mRNA, termed WP03-5 and 18-4. 

Figure 18B shows alignment of partial sequences between AD 3-4 and 
the G2a-a EcdBlfPsfl genomic clone. 

Figure 19 shows alignment of partial sequences between AD 4-4 and 
AD 2-2 and l-9a (also called SE-4 corresponding to the PCR clone which is 

15 identical to l-9a). 

Figure 20 shows alignment of partial sequences between AD 16c and 

Reg gene. 

Figure 20A shows alignment of partial sequences between AD 16c and 
human PTP. 

20 Figure 20B shows alignment of partial sequences between AD 16c and 

AD 2-2. 

Figures 21 A-21D show a genomic Southern blot analysis using the AD 
3-4 as a probe; Figure 21B shows a similar pattern of hybridization on a 
genomic Southern using AD 2-2 as a probe. Figures 21A-21D show a 

25 Northern blot analysis of neuroectodermal tumor cell lines using AD 3-4 as 

a probe. The four cell lines that exhibit AD 3-4 transcripts are neuronal in 
phenotype; C6 glioma cell mRNA did not hybridize with the AD 3-4 probe. 
Figure 21D shows a Northern analysis of human AD and aged control brain 
temporal lobe tissue using the AD 3-4 probe, and demonstrates over- 

30 expression of the corresponding gene in AD (lanes labeled A) compared with 

aged control brains (lanes labeled C). 
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Figures 22. 22A, 22B. 22C, 22D, 22E, 22F. 22G and 22H show 
partial sequences of four genomic clones (isolated using both the l-9a cDNA 
and rat FTP 0-18 cDNA as probes. 

Figures 23 and 23A show the alignment of the G2a-2 Pstl partial 

sequence with the Reg gene. 

Figure 23B shows alignment of the G2a-2 Pstl-EcoBl sequence and the 

Reg gene and the rat FTP. 

Figures 23C and 23D show the alignment of the G5d-1 Pstl sequence 

and the Reg gene. 

Figures 24A-24D show neural thread protein expression by the l-9a 
cDNA (Figure 24A) and the G2a-2 Pstl genomic clone (Figure 24B). Figures 
24C and 24D show negative expression by the G5d-i EcoWPstl genomic 
clone, and pBluescript which lacks a cloned insert, respectively. 

Figures 25A and 25B depict a Northern blot analysis of AD16c mRNA 
in AD and aged control brains. The data shows elevated levels of AD16c 
mRNA expression in 6 of 9 AD compared to 1 of 6 age-matched controls. 

Figure 26 depicts a Western blot analysis of AD10-7 fusion proteins 
using monoclonal antibodies against the expressed tag protein (T7-tag mouse 
monoclonal antibodies. 

Figures 27A and 27B depict brightfield and darkfield microscopic 
analysis of the in situ hybridization of sense and antisense cRNA probes to 
human brain tissue sections of early AD. 



Definitions 

In the description that follows, a number of terms used in recombinant 
DNA technology are utilized extensively. In order to provide a clear and 
consistent understanding of the specification and claims, including the scope 
to be given such terms, the following definitions are provided. 

Cloning vector. A plasmid or phage DNA or other DNA sequence 
which is able to replicate autonomously in a host cell, and which is 
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characterized by one or a small number of restriction endonuclease recognition 
sites at which such DNA sequences may be cut in a determinable fashion 
without loss of an essential biological function of the vector, and into which 
a DNA fragment may be spliced in order to bring about its replication and 
cloning. The cloning vector may further contain a marker suitable for use in 
the identification of cells transformed with the cloning vector. Markers, for 
example, provide tetracycline resistance or ampicillin resistance. 

Expression vector. A vector similar to a cloning vector but which is 
capable of enhancing the expression of a gene which has been cloned into it, 
after transformation into a host. The cloned gene is usually placed under the 
control of (i.e. , operably linked to) certain control sequences such as promoter 
sequences. Promoter sequences may be either constitutive or inducible. 

Substantially pure. As used herein means that the desired purified 
protein is essentially free from contaminating cellular components, said 
components being associated with the desired protein in nature, as evidenced 
by a single band following polyacrylamide-sodium dodecyl sulfate gel 
electrophoresis. Contaminating cellular components may include, but are not 
limited to, proteinaceous, carbohydrate, or lipid impurities. 

The term "substantially pure" is further meant to describe a molecule 
which is homogeneous by one or more purity or homogeneity characteristics 
used by those of skill in the art. For example, a substantially pure NTP will 
show constant and reproducible characteristics within standard experimental 
deviations for parameters such as the following: molecular weight, 
chromatographic migration, amino acid composition, amino acid sequence, 
blocked or unblocked N-terminus, HPLC elution profile, biological activity, 
and other such parameters. The term, however, is not meant to exclude 
artificial or synthetic mixtures of the factor with other compounds. In 
addition, the term is not meant to exclude NTP fusion proteins isolated from 
a recombinant host. 

Recombinant Host. According to the invention, a recombinant host 
may be any prokaryotic or eukaryotic cell which contains the desired cloned 
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genes on an expression vector or cloning vector. This term is also meant to 
include those prokaryotic or eukaryotic cells that have been genetically 
engineered to contain the desired gene(s) in the chromosome or genome of 
that organism. 

Recombinant vector. Any cloning vector or expression vector which 
contains the desired cloned gene(s). 

Host. Any prokaryotic or eukaryotic cell that is the recipient of a 
replicable expression vector or cloning vector. A "host," as the term is used 
herein, also includes prokaryotic or eukaryotic cells that can be genetically 
engineered by well known techniques to contain desired gene(s) on its 
chromosome or genome. For examples of such hosts, see Sambrook et al., 
Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, New York (1989). 

Promoter. A DNA sequence generally described as the 5' region of 
a gene, located proximal to the start codon. The transcription of an adjacent 
gene(s) is initiated at the promoter region. If a promoter is an inducible 
promoter, then the rate of transcription increases in response to an inducing 
agent. In contrast, the rate of transcription is not regulated by an inducing 
agent if the promoter is a constitutive promoter. 

Gene. A DNA sequence that contains information needed for 
expressing a polypeptide or protein. 

Structural gene. A DNA sequence that is transcribed into messenger 
RNA (mRNA) that is then translated into a sequence of amino acids 
characteristic of a specific polypeptide. 

Antisense RNA gene/Antisense RNA. In eukaryotes, mRNA is 
transcribed by RNA polymerase II. However, it is also known that one may 
construct a gene containing a RNA polymerase D template wherein a RNA 
sequence is transcribed which has a sequence complementary to that of a 
specific mRNA but is not normally translated. Such a gene construct is herein 
termed an "antisense RNA gene" and such a RNA transcript is termed an 
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"antisense RNA." Antisense RNAs are not normally translatable due to the 
presence of translation stop codons in the antisense RNA sequence. 

Antisense oligonucleotide. A DNA or RNA molecule containing a 
nucleotide sequence which is complementary to that of a specific mRNA. An 
antisense oligonucleotide binds to the complementary sequence in a specific 
mRNA and inhibits translation of the mRNA. 

Antisense Therapy. A method of treatment wherein antisense 
oligonucleotides are administered to a patient in order to inhibit the expression 
of the corresponding protein. 

Complementary DNA (cDNA). A "complementary DNA, " or "cDNA" 
gene includes recombinant genes synthesized by reverse transcription of 
mRNA and from which intervening sequences (introns) have been removed. 

Expression. Expression is the process by which a polypeptide is 
produced from a structural gene. The process involves transcription of the 
gene into mRNA and the translation of such mRNA into polypeptide(s). 

Homologous/Nonhomologous Two nucleic acid molecules are 
considered to be "homologous" if their nucleotide sequences share a similarity 
of greater than 50%, as determined by HASH-coding algorithms (Wilber, 
WJ. and Lipman, DJ., Proc. Natl. Acad. ScL «>:726-730 (1983)). Two 
nucleic acid molecules are considered to be -nonhomologous" if their 
nucleotide sequences share a similarity of less than 50%. 

Ribozyme. A ribozyme is an RNA molecule that contains a catalytic 
center. The term includes RNA enzymes, self-splicing RNAs, and self- 
cleaving RNAs. 

Ribozyme Therapy. A method of treatment wherein ribozyme is 
administered to a patient in order to inhibit the translation of the target 
mRNA. 

Fragment. A "fragment* of a molecule such as NTP is meant to refer 
to any polypeptide subset of that molecule. 

Functional Derivative. The term "functional derivatives" is intended 
to include the "variants," "analogues," or "chemical derivatives" of the 
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molecule. A "variant" of a molecule such as NTP is meant to refer to a 
naturally occurring molecule substantially similar to either the entire molecule, 
or a fragment thereof. An "analogue" of a molecule such as NTP is meant 
to refer to a non-natural molecule substantially similar to either the entire 
molecule or a fragment thereof. 

A molecule is said to be "substantially similar* to another molecule if 
the sequence of ammo acids in both molecules is substantially the same, and 
if both molecules possess a similar biological activity. Thus, provided that 
two molecules possess a similar activity, they are considered variants as that 
term is used herein even if one of the molecules contains additional amino 
acid residues not found in the other, or if the sequence of amino acid residues 
is not identical. 

As used herein, a molecule is said to be a "chemical derivative" of 
another molecule when it contains additional chemical moieties not normally 
a part of the molecule. Such moieties may improve the molecule's solubility, 
absorption, biological half-life, etc. The moieties may alternatively decrease 
the toxicity of the molecule, eliminate or attenuate any undesirable side effect 
of the molecule, etc. Examples of moieties capable of mediating such effects 
are disclosed in Remington's Pharmaceutical Sciences (1980) and will be 
apparent to those of ordinary skill in the ait. 

NTP. The term "NTP" refers to a family of neural thread proteins. 
The NTP family includes proteins with molecular weights of about 8 kDa, 14 
kDa, 17 kDa, 21 kDa, 26 kDa and 42 kDa, as described herein. 

ImmunO'Polymerase Chain Reaction. A method for the detection of 
antigens using specific antibody-DNA conjugates. According to this method, 
a linker molecule with bispecific binding affinity for DNA and antibodies is 
used to attach a DNA molecule specifically to an antigen-antibody complex. 
As a result, a specific antigen-antibody-DNA conjugate is formed. The 
attached DNA can be amplified by the polymerase chain reaction (PCR) using 
appropriate oligonucleotide primers. The presence of specific PCR products 
demonstrates that DNA molecules are attached specifically to antigen-antibody 
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complexes, thus indicating the presence of antigen. (Sano et al, Science 

255:120-122 (1992)). 

For example, Sano et al. supra, constructed a streptavidin-protein A 
chimera that possesses specific binding affinity for biotin and immunoglobulin 
5 G . TOs chimera (i.e., the -linker molecule") was used to attach a 

biotinylated DNA specifically to antigen-monoclonal antibody complexes that 
had been iinmobilized on microliter plate wells. A segment of the attached 
DNA was subsequently amplified by PCR. 

Detailed Description of the Invention 

10 This invention is directed to neural thread proteins (NTP), genetic 

sequences coding for an NTP mRNA or antisense mRNA, expression vectors 
confining the genetic sequences, recombinant hosts transformed therewith, 
and NTP and antisense RNA produced by such transformed recombinant host 
expression. This invention further relates to NTP ribozymes. and recombinant 

15 DNA molecules which code for NTP ribozymes and NTP antisense 

oligonucleotides. This invention further relates to antibodies directed against 
an NTP, as well as the use of NTP antibodies and NTP nucleic acid sequences 
for detection of the presence of an NTP in biological samples. The invention 
further relates to the use of NTP coding sequences in gene therapy. 

20 /. Isolation of DNA Sequences Coding for Neuronal Thread Proteins 

DNA sequences coding for an NTP are derived from a variety of 
sources. These sources include genomic DNA, cDNA, synthetic DNA, and 

combinations thereof. 

Human NTP genomic DNA can be extracted and purified from any 
25 human cell or tissue, by means well known in the art (for example, see 

Sambrook et al. , Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor Laboratory Press, 1989). The NTP genomic DNA of the 
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invcntion may or may not include naturally occurring introns. Moreover, 
such genomic DNA may be obtained in association with the 5' promoter 
region of the NTP gene sequences and/or with the 3' translational termination 
region. Further, such genomic DNA may be obtained in association with 
5 DNA sequences which encode the 5 ' nontranslated region of the NTP mRN A 

and/or with the genetic sequences which encode the 3' nontranslated region. 
To the extent that a host cell can recognize the transcriptional and/or 
translational regulatory signals associated with the expression of the mRNA 
and protein, then the 5' and/or 3' nontranscribed regions of the native gene, 

10 and/or, the 5' and/or 3' nontranslated regions of the mRNA, may be retained 

and employed for transcriptional and translational regulation. 

Alternatively, an NTP mRNA can be isolated from any ceil which 
expresses an NTP, and used to produce cDNA by means well known in the 
art (for example, see Sambrook et al., supra). Preferably, the mRNA 

15 preparation used will be enriched in mRNA coding for an NTP, either 

naturally, by isolation from cells which produce large amounts of an NTP, or 
in vitro, by techniques commonly used to enrich mRNA preparations for 
specific sequences, such as sucrose gradient centrifugation, or both. An NTP 
mRNA may be obtained from mammalian neuronal tissue, or from cell lines 

20 derived therefrom. Preferably, human cDNA libraries are constructed from 

17-18 week old fetal brain, 2 year old temporal lobe neocortex, end-stage AD 
cerebral cortex, or from cell lines derived from human neuronal tissue. Such 
cell lines may include, but are not limited to, central nervous system primitive 
neuroectodermal tumor cells (such as PNET1 or PNET2, as described herein), 

25 neuroblastoma cells (such as SH-Sy5y, as described herein), or human glioma 

cells (such as A172; ATCC CRL 1620). Alternatively, a rat cDNA library 
can be prepared from mRNA isolated from rat glioma cells, for example, C6 
rat glioma cells (ATCC CCL107). 

For cloning into a vector, suitable DNA preparations (either genomic 

30 or cDNA) are randomly sheared or enzymatically cleaved, respectively, and 

ligated into appropriate vectors to form a recombinant gene (either genomic 
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or cDNA) library. A DNA sequence encoding an NTP may be inserted into 
a vector in accordance with conventional techniques, including blunt-ending 
or staggered-ending termini for ligation, restriction enzyme digestion to 
provide appropriate termini, filling in of cohesive ends as appropriate, alkaline 
5 phosphatase treatment to avoid undesirable joining, and ligation with 

appropriate ligases. Techniques for such manipulation are disclosed by 
Sambrook et aL, supra, and are well known in the art. 

Libraries containing NTP clones may be screened and the NTP clones 
identified by any means which specifically selects for NTP DNA such as, for 

10 example: 1) by hybridization with an appropriate nucleic acid probe(s) 

containing a sequence specific for the DNA of this protein; or, 2) by 
hybridization-selected translational analysis in which native mRNA hybridizes 
to the clone in question, is translated in vitro, and the translation products are 
farther characterized; or, 3) if the cloned DNA sequences are themselves 

15 capable of expressing mRNA, by inununoprecipitation of a translated NTP 

product produced by the host containing the clone. 

Oligonucleotide probes specific for an NTP which can be used to 
identify clones to this protein can be designed from knowledge of the amino 
acid sequence of the corresponding NTP, or homologous regions of the PTP. 

20 Alternatively, oligonucleotide probes can be designed from knowledge of the 

nucleotide sequence of PTP (de la Monte et al, J. Clin. Invest 56:1004-1013 
(1990)). 

The suitable oligonucleotide, or set of oligonucleotides, which is 
capable of encoding a fragment of the NTP gene (or which is complementary 

25 to such an oligonucleotide, or set of oligonucleotides) may be synthesized by 

means well known in the art (for example, see Sambrook et aL, supra). 
Techniques of nucleic acid hybridization and clone identification are disclosed 
by Sambrook et al. $ supra. Those members of the above-described gene 
library which are found to be capable of such hybridization are then analyzed 

30 to determine the extent and nature of the NTP encoding sequences which they 

contain. 
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To facilitate the detection of the desired NTP coding sequence, the 
above-described DNA probe is labeled with a detectable group. Such 
detectable group can be any material having a detectable physical or chemical 
property. Such materials have been well-developed in the field of nucleic acid 
5 hybridization and in general most any label useful in such methods can be 

applied to the present invention. Particularly useful are radioactive labels 
including M P. J H, U C, m I, or the like. Any radioactive label may be 
employed which provides for an adequate signal and has sufficient half-life. 
The DNA probe may be labeled, for example, by nick-translation, by T4 
10 DNA polymerase replacement synthesis, or by random priming, among other 

methods well known in the art (see Sambrook el al. supra). 

Alternatively, DNA probes can be labeled with non-radioactive 
markers such as biotin, an enzyme, or fluorescent group. 

In an alternative method of cloning NTP DNA sequences, NTP cDNAs 
15 are obtained by direct cloning of cDNAs from cell lines and brain tissue, 

using the 3'- and 5 '-RACE methods, as described herein. Preferably, a 
human neuroectodermal tumor cell line or AD brain tissue is used as a source 
ofmRNA. 

//. Expressing the Gene Coding for NTP 

20 The above-discussed methods are, therefore, capable of identifying 

DNA sequences which are code for an NTP or fragments thereof. In order 
to further characterize such DNA sequences, and in order to produce the 
recombinant protein, it is desirable to express the proteins which the DNA 
sequences encode. 

25 To express an NTP, transcriptional and translational signals 

recognizable by an appropriate host are necessary. The cloned NTP DNA 
sequences, obtained through the methods described above, and preferably in 
double-stranded form, may be "operably linked" to sequences controlling 
transcriptional expression in an expression vector, and introduced into a host 
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cell, cither prokaryotic or eukaryotic, to produce recombinant NTP. 
Depending upon which strand of the NTP coding sequence is operably linked 
to the sequences controlling transcriptional expression, it is also possible to 
express an NTP antisense RNA. 

Expression of the NTP in different hosts may result in different post- 
translational modifications which may alter the properties of the NTP. 
Preferably, the present invention encompasses the expression of an NTP in 
eukaryotic cells, and especially mammalian, insect, and yeast cells. 
B^cd^va^o^^hB^nmia^^ Manimalian cells 
provide post-translational modifications to recombinant NTP which include 
folding and/or phosphorylation. Most preferably, mammalian host cells 
include human CNS primitive neuroectodermal tumor cells, human 
neuroblastoma cells, human glioma cells, or rat glioma cells. Especially 
preferred primitive neuroectodermal tumor cells include PNET1 and PNETZ, 
especially preferred human glioblastoma cells include Hgl6 and Hgl7. 
especially preferred human glioma cells include A172, and especially 
preferred rat glioma cells include C6 (see Example 1). 

Alternatively, an NTP may be expressed by prokaryotic host cells. 
Preferably, a recombinant NTP is expressed by such cells as a fusion protein, 
as described herein. An especially preferred prokaryotic host is E. coli. 
Preferred strains of E. coli include Y1088. Y1089. CSH18, ER1451, and 
ER1647 (see, for example, Molecular Biology LabFax, Brown. T.A., Ed.. 
Academic Press. New York (1991)). An alternative preferred host is Bacillus 
subtilus, including such strains as BR151. YB886. MI119, MI120. and B170 
(see, for example. Hardy. "Bacillus Cloning Methods/ in DNA Cloning: A 
Practical Approach, IRL Press, Washington, D.C. (1985)). 

A nucleic acid molecule, such as DNA. is said to be "capable of 
expressing" a polypeptide if it contains expression control sequences which in 
turn contain transcriptional regulatory information and such sequences are 
"operably linked" to the nucleotide sequence which encodes the protein. 
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Two sequences of a nucleic acid molecule are said to be operably 
linked when they are linked to each other in a manner which either permits 
both sequences to be transcribed onto the same RNA transcript, or permits an 
RNA transcript, begun in one sequence to be extended into the second 
sequence. Thus, two sequences, such as a promoter sequence and any other 
"second" sequence of DNA or RNA are operably linked if transcription 
commencing in the promoter sequence will produce an RNA transcript of the 
operably linked second sequence. In order to be operably linked it is not 
necessary that tv/o sequences be immediately adjacent to one another. 

The promoter sequences of the present invention may be either 
prokaiyotic, eukaryotic or viral. Suitable promoters are repressive, 
constitutive, or inducible. Examples of suitable prokaryotic promoters include 
promoters capable of recognizing the T4 polymerases (Malik et al, 7. Biol. 
Chem. 265:1174-1181 (1984); Rosenberg et al, Gene 59:191-200 (1987); 
Shinedling et al, J. Molec. Biol i95:47M80 (1987); Hu et al, Gene 42:21- 
30 (1986)), T3, Sp6, and T7 (Chamberlin et al, Nature 225:227-231 (1970); 
Bailey etal, Proc. Natl Acad. ScL (U.S.A.) 50:2814-2818 (1983); Davanloo 
etal. Proc. Natl Acad. ScL (U.S.A.) 57:2035-2039 (1984)); the P R and P L 
promoters of bacteriophage lambda (The Bacteriophage Lambda, Hershey, 
A.D., Ed., Cold Spring Harbor Press, Cold Spring Harbor, NY (1973); 
Lambda 77, Hendrix, R.W., Ed., Cold Spring Harbor Press, Cold Spring 
Haibor, NY (1980)); the trp, recA, heat shock, and lacZ promoters of E. coli; 
the of-amylase (Ulmanen et al, J. Bacteriol. 762:176-182 (1985)) and the 
delta-28-specific promoters of B. subtilis (Gilman et al, Gene 52:11-20 
(1984)); the promoters of the bacteriophages of Bacillus (Gryczan, In: The 
Molecular Biology of the Bacilli, Academic Press, Inc., NY (1982)); 
Streptomyces promoters (Ward et al, Mol Gen. Genet. 203:468-478 (1986)); 
the int promoter of bacteriophage lambda; the bla promoter of the C-lactamase 
gene of pBR322, and the CAT promoter of the chloramphenicol acetyl trans- 
ferase gene of pBR325, etc. Prokaiyotic promoters are reviewed by Glick, 
/. Ind. Microbiol 7:277-282 (1987); Cenatiempo, Biochimie 65:505-516 
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(1986); Watson et al.. In: Molecular Biology of the Gene, Fourth Edition, 

Benjamin Cummins, Mcnlo Park, CA (1987); Gottesman, Ann. Rev. Genet. 

J&415-442 (1984); and Sambrook et al., supra. 

Preferred eukaryotic promoters include the promoter of the mouse 
5 metallothionein I gene (Hamer et al. J. Mol. Appl. Gen. 1 :273-288 (1982)); 

the TK promoter of Herpes virus (McKnight, Cell 57:355-365 (1982)); the 

SV40 early promoter (Benoist, et al.. Nature (London) 290:304-310 (1981)); 

and the yeast gal4 gene promoter (Johnston, et al, Proc. Natl. Acad. Set. 

(USA) 79:6971-6975 (1982); Silver, et al., Proc. Natl. Acad. Sci. (USA) 
10 51:5951-5955 (1984)). All of the above listed references are incorporated by 

reference herein. 

Strong promoters are the most preferred promoters of the present 
invention. Examples of such preferred promoters are those which recognize 
the T3, SP6 and T7 polymerase promoters; the P t promoter of bacteriophage 

15 lambda; the recA promoter and the promoter of the mouse metallothionein I 

gene. The most preferred promoter for expression in prokaryotic cells is one 
which is capable of recognizing the T7 polymerase promoter. The sequences 
of such polymerase recognition sequences are disclosed by Watson, et al. (In: 
Molecular Biology of the Gene, Fourth Edition, Benjamin Cummins, Menlo 

20 Park, CA, (1987)). The most preferred promoter for expression in 

mammalian cells is SV40 (Gorman, "High Efficiency Gene Transfer into 
Mammalian cells," in DM4 Cloning: A Practical Approach, Volume H, IPX 
Press. Washington, D.C., pp. 143-190 (1985)). 



III. Methods of Detecting NTP 

This invention is directed towards methods of detecting neurological 
disease in a human subject, utilizing the nucleic acid probes hybridizable to 
NTP genes or transcripts, or antibodies specific for an NTP. By "neurological 
disease" is meant Alzheimer's Disease (AD), or other neurodegenerative 
disorders with the Alzheimer's type pathogenic changes (for example, 
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Paridnson's disease with AD-type neurodegeneration), as well as 
neuroectodermal tumors, malignant astrocytomas, and glioblastomas. By 
•human subject" is meant any human being or any developmental form 
thereof, such as a human embryo or fetus, prior to birth. The diagnostic 
5 methods of the present invention do not require invasive removal of neural 

tissue. 

The present invention additionally pertains to assays, both nucleic acid 
hybridization assays and immunoassays, for detecting the presence of NTP in 
cells or in the biological fluids of a human subject using light or electron 
10 microscopic histology, imaging, radioactive or enzyme based assays, and the 

like. 

A. Nucleic Acid Hybridization Assays 

In testing a tissue sample for an NTP using a nucleic acid hybridization 
assay, RNA can be isolated from tissue by sectioning on a cryostat and lysing 
the sections with a detergent such as SDS and a chelating agent such as 
EDTA, optionally with overnight digestion with proteinase K (50 /ig/ml). 
Such tissue is obtained by autopsy and biopsy. A preferred quantity of tissue 
is in the range of 1-10 milligrams. Protein is removed by phenol and 
chloroform extractions, and nucleic acids arc precipitated with ethanol. RNA 
is isolated by chromatography on an oligo dT column and then eluted 
therefrom. Further fractionation can also be carried out, according to methods 
well known to those of ordinary skill in the art. 

A number of techniques for molecular hybridization are used for the 
detection of DNA or RNA sequences in tissues; each has certain advantages 
and disadvantages. When large amounts of tissue are available, analysis of 
hybridization kinetics provides the opportunity to accurately quantitate the 
amount of DNA or RNA present, as well as to distinguish sequences that are 
closely related but not identical to the probe, and determine the percent 
homology. 
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Reactions are run under conditions of hybridization (Tm-25°C) in 
which the rate of rcassociation of the probe is optimal (Wetmur et al.. J. Mol. 
Biol. 32:349-370(1968)). The kinetics of the reaction are second- order when 
the sequences in the tissue arc identical to those of the probe; however, the 
reaction exhibits complex kinetics when probe sequences have partial 
homology to those in the tissue (Sharp et oL. J. Mol. Biol 36:709-726 
(1974)). 

The ratio of probe to cell RNA is determined by the sensitivity desired. 
To detect one transcript per cell would require about 100 pg of probe per |tg 
of total cellular DNA or RNA. The nucleic acids are mixed, denatured, 
brought to the appropriate salt concentration and temperature, and allowed to 
hybridize for various periods of time. The rate of rcassociation can be 
determined by quantitating the amount of probe hybridized either by hydroxy 
apatite chromatography (Britten et al. Science 767:529-540 (1968)) or SI 
nuclease digestion (Sutton, Biochim. Biophys. Acta 240:522-531 (1971)). 

A more flexible method of hybridization is the northern blot technique. 
This technique offers variability in the stringency of the hybridization reaction, 
as well as determination of the state of the retroviral sequences in the 
specimen under analysis. Northern analysis can be performed as described 
herein. 

A major consideration associated with hybridization analysis of DNA 
or RNA sequences is the degree of relatedness the probe has with the 
sequences present in the specimen under study. This is important with the 
blotting technique, since a moderate degree of sequence homology under 
nonstringent conditions of hybridization can yield a strong signal even though 
the probe and sequences in the sample represent non-homologous genes. 

The particular hybridization technique is not essential to the invention, 
any technique commonly used in the art being within the scope of the present 
invention. Typical probe technology is described in United States Patent 
4,358,535 to Falkow et al. , incorporated by reference herein. For example, 
hybridization can be carried out in a solution containing 6 x SSC (10 x SSC: 
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L5 M sodium chloride, 0.15 M sodium citrate, pH 7.0), 5 x Denhardt's (1 
x Denhardt's: 0.2% bovine serum albumin, 0.2% polyvinylpyrrolidone, 
0.02% FicoU 400), 10 mM EDTA, 0.5% SDS and about 10 7 cpm of nick- 
translated DNA for 16 hours at 65°C 

the labeled probes, as described above, provide a general diagnostic 
method for detection of an NTP in tissue. The method is reasonably rapid, 
has a simple protocol, has reagents which can be standardized and provided 
as commercial kits, and allows for rapid screening of large numbers of 
samples. 

In one method for carrying out the procedure, a clinical isolate 
containing RNA transcripts is fixed to a support. The affixed nucleic acid is 
contacted with a labeled polynucleotide having a base sequence complementary 
or homologous to the coding strand of the NTP gene. 

The hybridization assays of the present invention are particularly well 
suited for preparation and commercialization in kit form, the kit comprising 
a carrier means compartmentalized to receive one or more container means 
(vial, test tube, etc.) in close confinement, each of said container means 
comprising one of the separate elements to be used in hybridization assay. 

For example, there may be a container means containing NTP cDNA 
molecules suitable for labeling by "nick translation" (see, for example, 
Sambrook et al, supra, for standard methodology), or labeled NTP cDNA or 
RNA molecules. Further container means may contain standard solutions for 
nick translation of NTP cDNA comprising DNA polymerase I/DNase I and 
unlabeled deoxyribonucleotides (i.e., dCTP, dTTP, dGTP, and dATP), 

The presence of NTP RNA is determined by the variation in the 
appearance and/or quantity of probe-related RNA in tested tissue. 

The DNA probes of this invention can also be used for differential 
diagnosis of hereditary or familial AD and non-hereditary or sporadic AD. 
The familial form of AD often occurs at an earlier age and is associated with 
Down's syndrome in the family. Thus, a genetic test for familial AD allows 
for genetic counseling of families. While much effort has been directed 
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toward characterizing a genetic marker for familial AD (Gusella, FASEB J 
3:2036-2041 (1989); Hooper, / NIH Res. 4:48-54 (1992)), genetic linkage 
analysis only identifies a genetic marker sequence without providing the 
knowledge of the function of the genomic sequence. In contrast, the cDNA 

5 probes described herein and obtained from individuals with sporatic AD 

encode a known protein of known function which is over-expressed in brain 
tissue of patients with AD. 

Most cases of the AD disorder appear to be the sporadic form, 
although there are well-documented familial cases (Gusella, supra; Harrison's 

10 Principles of Internal Medicine, Braunwald et al , Eds., Eleventh Edition, Mc- 

Graw-Hill Book Company, New York, pp. 2012-2013 (1987)). A patient with 
familial AD, unlike a patient with sporadic AD, inherited the predisposing 
mutation through the germ cells. Some of the familial cases have been shown 
to follow an autosomal dominant pattern of inheritance (Id.). Thus, the DNA 

15 of a patient with familial AD will contain the inherited genetic alteration 

which is absent from the DNA of a patient with sporadic AD. 

A method of differentiating between sporadic and familial AD in a 
human subject involves obtaining a biological sample from the human subject 
who is suspected of having Alzheimer's Disease. Then, DNA is purified from 

20 the biological sample. Finally, the DNA is contacted with a NTP DNA probe 

under conditions of hybridization. Familial AD is indicated by the detection 
of a hybrid of the probe and the DNA, whereas sporadic AD is indicated by 
the absence of detection of hybridization. 

For example, the biological sample can be a blood sample which is 

25 subjected to differential centrifugation to enrich for white blood cells within 

three days of collection (Park, W PCR in the Diagnosis of Retinoblastoma," in 
PCR Protocols, Innis et al., Eds., Academic Press, Inc., New York, pp. 407- 
415 (1990)). The DNA sample can be prepared using the sodium N- 
lauroylsarcosine-Proteinase K, phenol, and RNase method (Sambrook et al, 

30 supra). DNA analysis can be performed by digesting the DNA sample, 

preferably 5 micrograms, with a restriction endonuclease (such as HindUl). 
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Digested DNA is then fractionated using agarose gel electrophoresis, 
preferably, a 1% horizontal agarose gel, for 18 hours in a buffer preferably 
containing 89 mM Tris-Hcl (pH 8), 89 mM sodium borate and 2 mM EDTA 
(Gusella et a/.. Nature 306:234-238 (1983)). Southern analysis can be 
5 performed using conventional techniques (Sambrook et al. t supra), and the 

labelled AD cDNA probes can be hybridized under conditions described 
above. The preferred DNA probes for this differential diagnosis method 
include l-9a ( AD3-4, AD+4 and G2-2 Pstl. 

B. Immunoassays 

10 Antibodies directed against an NTP can be used, as taught by the 

present invention, to detect and diagnose AD. Various histological staining 
methods, including immunohistochemical staining methods, may also be used 
effectively according to the teaching of the invention. Silver stein is but one 
method of visualizing NTP. Other staining methods useful in the present 

15 invention will be obvious to the artisan, the determination of which would not 

involve undue experimentation (see generally, for example, A Textbook of 
Histology, Eds. Bloom and Fawcett, W.B. Saunders Co., Philadelphia 
(1964)), 

One screening method for determining whether a given compound is 
20 an NTP functional derivative comprises, for example, immunoassays 

employing radioimmunoassay (RIA) or enzyme-linked immunosorbant assay 
(EUSA) methodologies, based on the production of specific antibodies 
(monoclonal or polyclonal) to an NTP. For these assays, biological samples 
are obtained by venepuncture (blood), spinal tap (cerebral spinal fluid (CSF)), 
25 urine and other body secretions such as sweat and tears. For example, in one 

form of RIA, the substance under test is mixed with diluted antiserum in the 
presence of radiolabeled antigen. In this method, the concentration of the test 
substance will be inversely proportional to the amount of labeled antigen 
bound to the specific antibody and directly related to the amount of free 
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labeled antigen. Other suitable screening methods will be readily apparent to 

those of skill in the art. 

The present invention also relates to methods of detecting an NTP or 
functional derivatives in a sample or subject For example, antibodies specific 

5 for an NTP, or a functional derivative, may be detectably labeled with any 

appropriate marker, for example, a radioisotope, an enzyme, a fluorescent 
label, a paramagnetic label, or a free radical. 

Alternatively, antibodies specific for an NTP. or a functional deriva- 
tive, may be detectably labeled with DNA by the technique of immuno- 

10 polymerase chain reaction (Sano et al.. Science 258: 120-122 (1992)). The 

polymerase chain reaction (PCR) procedure amplifies specific nucleic acid 
sequences through a series of manipulations including denaturation. annealing 
of oligonucleotide primers, and extension of the primers with DNA 
polymerase (see, for example. Mullis et al., U.S. Patent No. 4,683,202; 

15 Mullis et al., U.S. Patent No. 4,683,195; Loh et al. Science 243:211 

(1988) ). The steps can be repeated many times, resulting in a large 
amplification of the number of copies of the original specific sequence. As 
little as a single copy of a DNA sequence can be amplified to produce 
hundreds of nanograms of product (Li « al.. Nature 355:414 (1988)). Other 

20 known nucleic acid amplification procedures include transcription-based 

amplification systems (Kwoh et al.. Proc. Natl. Acad. Sci. USA S6:1173 

(1989) ; Gingeras et al., WO 88/10315), and the "ligase chain reaction" in 
which two (or more) oligonucleotides are ligated in the presence of a nucleic 
acid target having the sequence of the resulting "di-oligonucleotide" thereby 

25 amplifying the di-oligonucleotide (Wu et al.. Genomics 4:560 (1989); 

Backman et al., EP 320,308; Wallace, EP 336.731; Orgel. WO 89/09835). 
For example, the immuno-PCR assay can be carried out by 

immobilizing various amounts of the test material on the surface of microliter 

wells (see Sanzo et al.. supra, page 122, footnote 7). The wells are 
30 subsequently incubated with an NTP monoclonal antibody, washed, and then 

incubated with biotinylated NTP DNA molecules which have been conjugated 
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to stieptavidin-protein chimera (Id.). This chimera binds biotin (via the 
streptavidin moiety) and the Fc portion of an immunoglobulin G molecule (via 
the protein A moiety) (Id., at 120; Sanzo et al., Bio/Technology 9:1378 
(1991))- The wells are then washed to remove unbound conjugates. Any 
NTP present in the test material will be bound by the NTP monoclonal 
antibody, which in turn, is bound by the protein A moiety of the biotinylated 
NTP DNA - streptavidin-protein A conjugate. Then, the NTP DNA 
sequences are amplified using PGR.* Briefly, the microtiter wells are 
incubated with deoxyribonucleoside triphosphates, NTP oligonucleotide 
primers, and Taq DNA polymerase (see Sanzo et al., supra, page 122, 
footnote 1 1). An automated thermal cycler (such as the PTC-100-96 Thermal 
Cycler, MJ Research, Inc.) can be used to perform PCR under standard 
conditions (Id.). The PCR products are then analyzed by agarose gel 
electrophoresis after staining with ethidium bromide. 

Methods of making and detecting such detectably labeled antibodies or 
their functional derivatives are well known to those of ordinary skill in the art, 
and are described in more detail below. Standard reference works setting 
forth the general principles of immunology include the work of Klein 
(Immunology: The Science of Self-Nonself Discrimination, John Wiley & 
Sons, New York (1982)); Kennett et al. (Monoclonal Antibodies and 
Hybridomas: A New Dimension in Biological Analyses, Plenum Press, New 
York (1980)); Campbell ("Monoclonal Antibody Technology/ In: Laboratory 
Techniques in Biochemistry and Molecular Biology, Volume 13 (Buidon, R. f 
et aL, eds.) ( Elsevier, Amsterdam (1984)); and Eisen (In: Microbiology, 3rd 
Ed. (Davis, et al, Harper & Row, Philadelphia (1980)). 

The term "antibody" refers both to monoclonal antibodies which are 
a substantially homogeneous population and to polyclonal antibodies which are 
heterogeneous populations. Polyclonal antibodies are derived from the sera 
of animals immunized with an antigen. Monoclonal antibodies (mAbs) to 
specific antigens may be obtained by methods known to those skilled in the 
art. See, for example, Kohler and Milstein, Nature 256:495-497 (1975) and 
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U.S. Patent No. 4,376,110. Such antibodies may be of any immunoglobulin 
class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. 

The monoclonal antibodies, particularly mAbs Th7, Th9, and ThlO 
used in the present invention, may be prepared as previously described (Gross 

5 etal., J. Clin. Invest. 76:11 15-2126 (1985); Ozturk et al , Proc. Natl Acad. 

Sci. USA 56:419-423 (1989); de la Monte et. al, J. Clin. Invest. 56:1004- 
1013 (1990); de la Monte et. al.. J. Neurol. Sci. 2/5:152-164 (1992); de la 
Monte et al, Ann. Neurol. 32:733-142 (1992)). The Th monoclonal 
antibodies were generated against the purified pancreatic form of thread 

10 protein (Id.). NTP-specific polyclonal and monoclonal antibodies can also be 

generated against a substantially pure NTP isolated from recombinant hosts 
(for example, see Carroll et al, "Production and Purification of Polyclonal 
Antibodies to the Foreign Segment of B-Galactosidase Fusion Proteins," in 
DNA Cloning: A Practical Approach. Volume IE, IRL Press, Washington, 

15 D.C., pp. 89-111 (1987); Mole et al. "Production of Monoclonal Antibodies 

Against Fusion Proteins Produced in Escherichia coli,' mDNA Cloning: A 
Practical Approach, Volume in. IRL Press, Washington, D.C., pp. 113-1139 
(1987)). Alternatively, NTP-specific polyclonal and monoclonal antibodies 
can be generated against a substantially pure NTP isolated from biological 

20 material such as brain tissue and cell lines, by using well known techniques. 

For example, monoclonal antibodies specific for the various NTP 
molecules of approximately, 8, 14, 17, 21, 26 kDa and 42 kDa molecular 
weights may be prepared from recombinant-derived proteins, which are 
expressed, isolated and purified from the cDNA (i.e., l-9a), genomic clones 

25 (G2-2 PstT) and AD-NTP 3-4 cDNA clones. These NTP molecules are 

derived from the above cDNA's and genomic clones, inserted and produced 
in suitable expression vectors (see Figures 2A and 2B). Since there are 
regions of 60-70% homology in the 5' ends of the l-9a NTP cDNA and PTP, 
one can obtain monoclonal antibodies that bind specifically to the NTP 

30 recombinant proteins and not to the pancreatic form by performing routine 

differential screening (see. for example, de la Monte et al. J. Clin. Invest. 
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86: 1004-1013 (1990)). Although there will be monoclonal antibodies that 
bind to both NTP and FTP, it will be possible to generate NTP-speciffc 
monoclonal antibodies because there is a substantial sequence divergence 
between NTP molecules of various forms (e.g., 8, 14, 17, 21, 26 and 42 kDa) 

5 and because an epitope may be defined by as few as 6-8 amino acids. 

The terra "antibody" is also meant to include both intact molecules as 
well as fragments thereof, such as, for example, Fab and F(ab') 2 . which are 
capable of binding antigen. Fab and F(ab')j fragments lack the Fc fragment 
of intact antibody, clear more rapidly from the circulation, and may have less 

10 non-specific tissue binding than an intact antibody (Wahl et al.. J. Nucl. Med. 

24:316-325 (1983)). 

It will be appreciated that Fab and F(ab')2 and other fragments of the 
antibodies useful in the present invention may be used for the detection and 
quantitation of an NTP according to the methods disclosed herein in order to 
15 detect and diagnose AD in the same manner as an intact antibody. Such 

fragments are typically produced by proteolytic cleavage, using enzymes such 
as papain (to produce Fab fragments) or pepsin (to produce F(ab') I 
fragments). 

An antibody is said to be "capable of binding" a molecule if it is 
20 capable of specifically reacting with the molecule to thereby bind the molecule 

to the antibody. The term "epitope" is meant to refer to that portion of any 
molecule capable of being bound by an antibody which can also be recognized 
by that antibody. Epitopic determinants usually consist of chemically active 
surface groupings of molecules such as amino acids or sugar side chains and 
25 have specific three dimensional structural characteristics as well as specific 

charge characteristics. 

An "antigen" is a molecule capable of being bound by an antibody 
which is additionally capable of inducing an animal to produce antibody 
capable of binding.to an epitope of that antigen. An antigen may have one, 
30 or more than one epitope. The specific reaction referred to above is meant 

to indicate that the antigen will react, in a highly selective manner, with its 
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corresponding antibody and not with the multitude of other antibodies which 
may be evoked by other antigens. 

The antibodies, or fragments of antibodies, useful in the present 
invention may be used to quantitatively or qualitatively detect the presence of 
cells which contain the NTP antigens. Thus, the antibodies (or fragments 
thereof) useful in the present invention may be employed histologically to 
detect or visualize the presence of an NTP. 

Such an assay for an NTP typically comprises incubating a biological 
sample from said subject suspected of having such a condition in the presence 
of a detectably labeled binding molecule (e.g. , antibody) capable of idenufying 
an NTP, and detecting said binding molecule which is bound in a sample. 

Thus, in this aspect of the invention, a biological sample may be 
treated with nitrocellulose, or other solid support which is capable of 
immobilizing cells, cell particles or soluble proteins. The support may then 
be washed with suitable buffers followed by treatment with the detectably 
labeled NTP-specific antibody. The solid phase support may then be washed 
with the buffer a second time to remove unbound antibody. The amount of 
bound label on said solid support may then be detected by conventional 
means. 

By "solid phase support" is intended any support capable of binding 
antigen or antibodies. Well-known supports, or carriers, include glass, poly- 
styrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, agaroses, and magnetite. The nature of 
the carrier can be either soluble to some extent or insoluble for the purposes 
of the present invention. The support material may have virtually any 
possible structural configuration so long as the coupled molecule is capable of 
binding to an antigen or antibody. Thus, the support configuration may be 
spherical, as in a bead, or cylindrical, as in the inside surface of a test tube, 
or the external surface of a rod. Alternatively, the surface may be flat such 
as a sheet, test strip, etc. Preferred supports include polystyrene beads. 
Those skilled in the art will note many other suitable carriers for binding 
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monoclonal antibody or antigen, or will be able to ascertain the same by use 
of routine experimentation. 

One embodiment for carrying out the diagnostic assay of the present 
invention on a biological sample containing an NTP, comprises: 

(a) contacting a detectably labeled NTP-specific antibody 
with a solid support to effect immobilization of said NTT-specific antibody or 
a fragment thereof; 

(b) contacting a sample suspected of containing an NTP with 

said solid support; 

(c) incubating said detectably labeled NTP-specific antibody 
with said support for a time sufficient to allow the immobilized NTP-specific 
antibody to bind to the NTP; 

(d) separating the solid phase support from the incubation 

mixture obtained in step (c); and 

(e) detecting the bound label and thereby detecting and 

quantifying NTP, 

Alternatively, labeled NTP-specific antibody/NTP complexes in a 
sample may be separated from a reaction mixture by contacting the complex 
with an immobilized antibody or protein which is specific for an 
immunoglobulin, e.g.. Staphylococcus protein A, Staphylococcus protein G t 
anti-IgM or anti-IgG antibodies. Such anti-immunoglobulin antibodies may 
be polyclonal, but are preferably monoclonal. The solid support may then be 
washed with a suitable buffer to give an immobilized NTP/labeled NTP- 
specific antibody complex. The label may then be detected to give a measure 
of an NTP. 

This aspect of the invention relates to a method for detecting an NTP 
or a fragment thereof in a sample comprising: 

(a) contacting a sample suspected of containing an NTP with 
an NTP-specific antibody or fragment thereof which binds to NTP; and 

(b) detecting whether a complex is formed. 
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The invention also relates to a method of detecting an NTT in a 

sample, farther comprising: 

(c) contacting the mixture obtained in step (a) with an Fc 
binding molecule, such as an antibody, Staphylococcus protein A, or 
Staphylococcus protein O, which is irnmobilized on a solid phase support and 
is specific for the NTP-specific antibody to give a NTP/NTP-specific antibody 

immobilized antibody complex; 

(d) washing the solid phase support obtained in step (c) to 

remove unbound KTP/NTP-specific antibody complex; 

(e) and detecting the label bound to said solid support. 
Of course, the specific concentrations of detectably labeled antibody 

and NTP, the temperature and time of incubation, as well as other assay 
conditions may be varied, depending on various factors including the 
concentration of an NTP in the sample, the nature of the sample, and the like. 
The binding activity of a given lot of anti-NTP antibody may be determined 
according to well known methods. Those skilled in the art will be able to 
detennine operative and optimal assay conditions for each determination by 
employing routine experimentation. 

Other such steps as washing, stirring, shaking, filtering and the like 
may be added to the assays as is customary or necessary for the particular 
situation. 

One of the ways in which the NTP-specific antibody can be detectably 
labeled is by linking the same to an enzyme. This enzyme, in turn, when 
later exposed to its substrate, will react with the substrate in such a manner 
as to produce a chemical moiety which can be detected, for example, by 
spectrophotometry fluorometric or by visual means. Enzymes which can be 
used to detectably label the NTP-specific antibody include, but are not limited 
to, malate dehydrogenase, staphylococcal nuclease, delta-V-steroid isoraerase, 
yeast alcohol dehydrogenase, a-glycerophosphate dehydrogenase, triose 
phosphate isomerase, horseradish peroxidase, alkaline phosphatase. 
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asparaginase, glucose oxidase, fl-galactosidase, ribonuclease, urease, catalase, 
glucose-VI-phosphate dehydrogenase, glucoamylase and acetylcholinesterase. 
Detection may be accomplished using any of a variety of immuno- 
assays. For example, by radioactively labeling the NTP-specific antibodies 

5 or antibody fragments, it is possible to detect NTP through the use of 

iadioimmune assays. A good description of a radioimmune assay may be 
found in Laboratory Techniques and Biochemistry in Molecular Biology, by 
Work, ex al , North Holland Publishing Company, NY (1978), with particular 
reference to the chapter entitled "An Introduction to Radioimmune Assay and 

10 Related Techniques" by Chard, incorporated by reference herein. 

The radioactive isotope can be detected by such means as the use of 
a gamma counter or a scintillation counter or by autoradiography. Isotopes 
which are particularly useful for the purpose of the present invention are: 3 H, 
,25 1, 13l l, M S, ,4 C, and preferably l25 L 

15 it is also possible to label the NTP-specific antibody with a fluorescent 

compound. When the fluorescently labeled antibody is exposed to light of the 
proper wave length, its presence can then be detected due to fluorescence. 
Among the most commonly used fluorescent labelling compounds are 
fluorescein isothiocyanate, rhodamine, phycoeiytherin, phycocyanin, 

20 allophycocyanin, o-phthaldehyde and fluorescamine. 

The NTP-specific antibody can also be detectably labeled using 
fluorescence emitting metals such as l52 Eu, or others of the lanthanide series. 
These metals can be attached to the NTP-specific antibody using such metal 
chelating groups as diethylenetriaminepentaacetic acid (DTPA) or 

25 ethylenediaminetetraacetic acid (EDTA). 

The NTP-specific antibody also can be detectably labeled by coupling 
it to a chemiluminescent compound. The presence of the chemiluminescent- 
tagged NTP-specific antibody is then determined by detecting the presence of 
luminescence that arises during the course of a chemical reaction. Examples 

30 of particularly useful chemiluminescent labeling compounds are luminol, 
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isoluminol, theromatic acridinium ester, imidazole, acridinium salt and oxalate 
ester. 

The NTP-specific antibody may also be labeled with biotin and then 
reacted with avidin. A biotin-labeled DNA fragment will be linked to the 
NTP-btotinylated monoclonal antibody by an avidin bridge. NTP molecules 
are then detected by polymerase chain reaction (PCR) amplification of the 
DNA fragment with specific primers (Sano et al, Science 258: 120-122 
(1992)). 

likewise, a bioluminescent compound .may be used to label the NTP- 
specific antibody of the present invention. Bioluminescence is a type of 
chemiluminescence found in biological systems in which a catalytic protein 
increases the efficiency of the chemiluminescent reaction. The presence of a 
bioluminescent protein is determined by detecting the presence of 
luminescence. Important bioluminescent compounds for purposes of labeling 
are luciferin, luciferase and aequorin. 

Detection of the NTP-specific antibody may be accomplished by a 
scintillation counter, for example, if the detectable label is a radioactive 
gamma emitter, or by a fluorometer, for example, if the label is a fluorescent 
material. In the case of an enzyme labe.1, the detection can be accomplished 
by coloriraetric methods which employ a substrate for the enzyme. Detection 
may also be accomplished by visual comparison of the extent of enzymatic 
reaction of a substrate in comparison with similarly prepared standards. 

The detection of foci of such detectably labeled antibodies is indicative 
of a disease or dysfunctional state as previously described. For the purposes 
of the present invention, the NTP which is detected by this assay may be 
present in a biological sample. Any sample coiitaining an NTP can be used. 
However, one of the benefits of the present diagnostic invention is that 
invasive tissue removal may be avoided. Therefore, preferably, the sample 
is a biological solution such as, for example, cerebrospinal fluid, amniotic 
fluid, blood, serum, urine and the like. However, the invention is not limited 
to assays using only these samples, it being possible for one of ordinary skill 
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in the art to determine suitable conditions which allow the use of other 
samples. 

For example, the three-site monoclonal antibody-based 
immunoradiometric assays (M-IRMA) may be used to measure NTP levels in 
a biological fluid, such as CSF. It is possible to obtain, by spinal tap, on a 
routine basis, CSF from individuals suspected of having AD. Thus, the 
diagnosis of AD can be established by a simple, non-invasive immunoassay 
which reveals NTP levels greatly increased over normal levels. 

In one embodiment, as described above, this examination for AD is 
accomplished by removing samples of biological fluid and incubating such 
samples in the presence of detectably labeled antibodies (or antibody 
fragments). In a preferred embodiment, this technique is accomplished in a 
non-invasive manner through the use of magnetic imaging, fluorography, etc. 

Preferably, the detection of cells which express an NTP may be 
accomplished by in vivo imaging techniques, in which the labeled antibodies 
(or fragments thereof) are provided to a subject, and the presence of the NTP 
is detected without the prior removal of any tissue sample. Such in vivo 
detection procedures have the advantage of being less invasive than other 
detection methods, and are, moreover, capable of detecting the presence of 
NTP in tissue which cannot be easily removed from the patient, such as brain 
tissue. 

Using in vivo imaging techniques, it will be possible to differentiate 
between AD and a brain tumor because NTP will be detected throughout the 
brain in an AD patient, while NTP will be localized in discrete deposits in the 
case of brain tumors. For example, in brains of AD patients, NTP will be 
found in the temporal, parietal and frontal cortices as well as the amygdala 
and hippocampus. Favored cites for astrocytomas include the cerebrum, 
cerebellum, thalamus, optic chiasma, and pons (Harrison's Principles of 
Internal Medicine, Petersdorf et al , Eds. , Tenth Edition, McGraw-Hill Book 
Company, New York, p.2076 (1983)), and glioblastoma multiforme is 
predominantly cerebral in location {Id, at 2075). 
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There are many different in vivo labels and methods of labeling known 
to those of ordinary skill in the art Examples of the types of labels which 
can be used in the present invention include radioactive isotopes and 
paramagnetic isotopes. Those of ordinary skill in the art will know of other 

5 suitable labels for binding to the antibodies used in the invention, or will be 

able to ascertain such, using routine experimentation. Furthermore, the 
binding of these labels to the antibodies can be done using standard techniques 
common to those of ordinary skill in the art. 

An important factor in selecting a radionuclide for in vivo diagnosis is 

10 that the half-life of a radionuclide be long enough so that it is stilt detectable 

at the time of maximum uptake by the target, but short enough so that 
deleterious radiation upon the host is minimized. Ideally, a radionuclide used 
for in vivo imaging will lack a particulate emission, but produce a large 
number of photons in the 140-200 keV range, which maybe readily detected 

15 by conventional gamma cameras. 

For in vivo diagnosis radionuclides may be bound to antibody either 
directly or indirectly by using an intermediary functional group. Intermediary 
functional groups which are often used in binding radioisotopes which exist 
as metallic ions to immunoglobulins are DTPA and EDTA. Typical examples 

20 of ions which can be bound to immunoglobulins are ""Tc, 123 I, lll In, m I, 

"Ru, *Cu t ^Ga, m l "Ga, n As. w Zr, and M1 T1. 

For diagnostic in vivo imaging, the type of detection instrument 
available is a major factor in selecting a given radionuclide. The radionuclide 
chosen must have a type of decay which is detectable for a given type of 

25 instrument. In general, any conventional method for visualizing diagnostic 

imaging can be utilized in accordance with this invention. For example, PET, 
gamma, beta, and MRI detectors can be used to visualize diagnostic 
imagining. 

The antibodies useful in the invention can also be labeled with 
30 paramagnetic isotopes for purposes of in vivo diagnosis. Elements which are 
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particularly useful, as in Magnetic Resonance Imaging (MRI), include 157 Gd, 
^Mn, m Dy, and *Fe. 

The antibodies (or fragments thereof) useful in the present invention 
are also particularly suited for use in in vitro immunoassays to detect the 
presence of an NTP in body tissue, fluids (such as CSF), or cellular extracts. 
In such immunoassays, the antibodies (or antibody fragments) may be utilized 
in liquid phase or, preferably, bound to a solid-phase carrier, as described 
above. 

Those of ordinary skill in the art will know of other suitable labels 
which may be employed in accordance with the present invention. The 
binding of these labels to antibodies or fragments thereof can be accomplished 
using standard techniques commonly known to those of ordinary skill in the 
art. Typical techniques are described by Kennedy, et al (Clin, Chim. Acta 
70:1-31 (1976)) and Schurs, et al. (Clin. Chim. Acta 57:1-40 (1977)). 
Coupling techniques mentioned in the latter are the glutaraldehyde method, the 
periodate method, the dimaleirnide method, the m-maleimidobenzyl-N- 
hydroxy-succinimide ester method, all of which methods are incorporated by 
reference herein. 

In situ detection may be accomplished by removing a histological 
specimen from a patient, and providing the combination of labeled antibodies 
of the present invention to such a specimen. The antibody (or fragment) is 
preferably provided by applying or by overlaying the labeled antibody (or 
fragment) to a biological sample. Through the use of such a procedure, it is 
possible to determine not only the presence of an NTP, but also the 
distribution of an NTP on the examined tissue. Using the present invention, 
those of ordinary skill will readily perceive that any of a wide variety of 
histological methods (such as staining procedures) can be modified in order 
to achieve such in situ detection. 

The binding molecules of the present invention may be adapted for 
utilization in an immunometric assay, also known as a "two-site" or 
"sandwich" assay. In a typical immunometric assay, a quantity of unlabeled 
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antibody (or fragment of antibody) is bound to a solid support that is insoluble 
in the fluid being tested (i.e., CSF) and a quantity of detectably labeled 
soluble antibody is added to permit detection and/or quantitation of the ternary 
complex formed between solid-phase antibody, antigen, and labeled antibody. 

5 Typical, and preferred, immunometric assays include "forward' assays 

in which the antibody bound to the solid phase is first contacted with the 
sample being tested to extract the antigen from the sample by formation of a 
binary solid phase antibody-antigen complex. After a suitable incubation 
period, the solid support is washed to remove the residue of the fluid sample, 

10 including unreacted antigen, if any, and then contacted with the solution 

containing an unknown quantity of labeled antibody (which functions as a 
"reporter molecule"). After a second incubation period to permit the labeled 
antibody to complex with the antigen bound to the solid support through the 
unlabeled antibody, the solid support is washed a second time to remove the 

15 unreacted labeled antibody. This type of forward sandwich assay may be a 

simple "yes/no" assay to determine whether antigen is present or may be made 
quantitative by comparing the measure of labeled antibody with that obtained 
for a standard sample containing known quantities of antigen. Such "two-site" 
or "sandwich" assays are described by Wide at pages 199-206 of 

20 Radioimmwe Assay Method, edited by Kirkham and Hunter, E. & S. 

Livingstone, Edinburgh, 1970. 

In another type of "sandwich" assay, which may also be useful with 
the antigens of the present invention, the so-called "simultaneous" and 
"reverse" assays are used. A simultaneous assay involves a single incubation 

25 step as the antibody bound to the solid support and labeled antibody are both 

added to the sample being tested at the same time. After the incubation is 
completed, the solid support is washed to remove the residue of fluid sample 
and uncomplexed labeled antibody. The presence of labeled antibody 
associated with the solid support is then determined as it would be in a 

30 conventional "forward" sandwich assay. 
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In the "reverse" assay, stepwise addition first of a solution of labeled 
antibody to the fluid sample followed by the addition of unlabeled antibody 
bound to a solid support after a suitable incubation period is utilized. After 
a second incubation, the solid phase is washed in conventional fashion to free 
5 it of the residue of the sample being tested and the solution of unreacted 

labeled antibody. The determination of labeled antibody associated with a 
solid support is then determined as in the "simultaneous" and "forward" 
assays. 

The above-described in vitro or in vivo detection methods may be used 

10 in the detection and diagnosis of AD without the necessity of removing tissue. 

Such detection methods may be used to assist in the determination of the stage 
of neurological deterioration in AD by evaluating and comparing the 
concentration of an NTP in the biological sample. 

As used herein, an effective amount of a diagnostic reagent (such as 

15 an antibody or antibody fragment) is one capable of achieving the desired 

diagnostic discrimination and will vary depending on such factors as age, 
condition, sex, the extent of disease of the subject, counterindications, if any, 
and other variables to be adjusted by the physician. The amount of such 
materials which are typically used in a diagnostic test are generally between 

20 0.1 to 5 mg, and preferably between 0.1 to 0.5 rag. 

The assay of the present invention is also ideally suited for the 
preparation of a kit. Such a kit may comprise a carrier means being 
compartmentalized to receive in close confinement therewith one or more 
container means such as vials, tubes and the like, each of said container means 

25 comprising the separate elements of the immunoassay. 

For example, there may be a container means containing a first 
antibody immobilized on a solid phase support, and a further container means 
containing a second detectably labeled antibody in solution. Further container 
means may contain standard solutions comprising serial dilutions of the NTP 

30 to be detected. The standard solutions of an NTP may be used to prepare a 

standard curve with the concentration of NTP plotted on the abscissa and the 
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detection signal on the ordinate. The results obtained from a sample 
containing an NT? may be interpolated from such a plot to give the 
concentration of the NTP. 

IV. Isolation ofNTP 

The NTP proteins or fragments of this invention may be obtained by 
expression from recombinant DNA as described above. Alternatively, an 
NTP may be purified from biological material. 

For purposes of the present invention, one method of purification 
which is illustrative, without being limiting, consists of the following steps. 

A first step in the purification of an NTP includes extraction of the 
NTP fraction from a biological sample, such as brain tissue or CSF, in 
buffers, with or without solubilizing agents such as urea, formic acid, 
detergent, or thiocyanate. 

A second step includes subjecting the solubilized material to ion- 
exchange chromatography on Mono-Q or Mono-S columns (Pharmacia LKB 
Biotechnology, Inc; Piscataway, NJ). Similarly, the solubilized material may 
be separated by any other process wherein molecules can be separated 
according to charge density, charge distribution and molecular size, for 
example. Elution of the NTP from the ion-exchange resin are monitored by 
an immunoassay, such as M-IRMA, on each fraction. Iramunoreactive peaks 
would are then dialyzed, lyophilized, and subjected to molecular sieve, or gel 
chromatography. 

Molecular sieve or gel chromatography is a type of partition 
chromatography in which separation is based on molecular size. Dextran, 
polyacrylamide, and agarose gels are commonly used for this type of 
separation. One useful gel for the present invention is Sepharose 12 
(Pharmacia LKB Biotechnology, Inc.). However, other methods, known to 
those of skill in the art may be used to effectively separate molecules based 
on size. 
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A fourth step in a purification protocol for an NTP includes analyzing 
the immunoreactive peaks by sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis (SDS-PAGE), a further gel chromatographic purification step, 
and staining, such as, for example, silver staining. 

A fifth step in a purification method includes subjecting the NTP 
obtained after SDS-PAGE to affinity chromatography, or any other procedure 
based upon affinity between a substance to be isolated and a molecule to 
which it can specifically bind. For further purification of an NTP, affinity 
chromatography on Sepharose conjugated to anti-NTP mAbs (such as Th9, or 
specific mABs generated against substantially pure NTP) can be used. 
Alternative methods, such as reverse-phase HPLC, or any other method 
characterized by rapid separation with good peak resolution are useful. 

Another method to purify an NTP is to use concentrated CSF obtained 
from patients with AD. For this procedure, 30-40 milliliters are concentrated 
by lyophilization or Amicon filtration or the like, and subjected to two 
dimensional gel electrophoresis. Proteins are separated in one direction by 
charge in a pH gradient and then, subjected to molecular sieve 
chromatography in the other direction by polyacrylamide gel electrophoresis. 
NTP-immunoreactive proteins are identified as spots by the Th monoclonal 
antibodies (for example, Th 9) using Western blot anaylsis. The gel is cut 
and NTP proteins are eluted from the gel. NTP purified in this manner can 
be sequenced or used to make new monoclonal antibodies. 

It will be appreciated that other purification steps may be substituted 
for the preferred method described above, those of skill in the art will be 
able to devise alternate purification schemes without undue experimentation. 

V. Gene Therapy Using Antisense Oligonucleotides and Ribozymes 

Antisense oligonucleotides have been described as naturally occurring 
biological inhibitors of gene expression in both prokaryotes (Mizuno et aL, 
Proc. Nazi Acad, ScL USA 81: 1966-1970 (1984)) and eukaryotes (Heywood, 
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Nucleic Acids Res. 7*6771-6772 (1986)). and these sequences presumably 
function by hybridizing to complementary mRNA sequences, resulting in 
hybridization arrest of translation (Paterson, et at., Proc. Natl. Acad. Sa. 

USA, 74:4370-4374 (1987)). 

Antisense oligonucleotides are short synthetic DNA or RNA nucleotide 
molecules formulated to be complementary to a specific gene or RNA 
message. Through the binding of these oligomers to a target DNA or mRNA 
sequence, transcription or translation of the gene can be selectively blocked 
and the disease process generated by that gene can be halted (see. for 
example. Jack Cohen. Oligodeaxynucleotides. Antisense Inhibitors of Gene 
Expression. CRC Press (1989)). The cytoplasmic location of mRNA provides 
a target considered to be readily accessible to antisense oligodeoxynucleotides 
entering the cell; hence much of the work in the field has focused on RNA as 
a target. Currently, the use of antisense oligodeoxynucleotides provides a 
useful tool for exploring regulation of gene expression in vitro and in tissue 
culture (Rothenberg. et al. J. Natl Cancer Inst. Si:1539-1544 (1989)). 

Antisense therapy is the administration of exogenous oligonucleotides 
which bind to a target polynucleotide located within the cells. For example, 
antisense oligonucleotides may be administered systemically for anticancer 
therapy (Smith. International Application Publication No. WO 90/09180). As 
described herein. NTP-related proteins are produced by neuroectodermal 
tumor cells, malignant astrocytoma cells, glioblastoma cells, and in relatively 
high concentrations (i.e. relative to controls) in brain tissue of AD patients. 
Thus NTP antisense oligonucleotides of the present invention may be active 
in treatment against AD. as well as r*uroectodermal tumors, malignant 
astrocytomas, and glioblastomas. 

The NTP antisense oligonucleotides of the present invention include 
derivatives such as S-oligonucleotides (phosphorothioate derivatives or S- 
oligos. see. Jack Cohen, supra). S-oligos (nucleoside phosphorothioates) are 
isoelectronic analogs of an oligonucleotide (O-oligo) in which a nonbndging 
oxygen atom of the phosphate group is replaced by a sulfur atom. The S- 
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oligos of the present invention may be prepared by treatment of the 
corresponding O-oligos with 3tf-l t 2-benzoditMol-3-one-l,l-dioxide which is 
a sulfur transfer reagent. See Iyer et al., /. Org. Chem. 55:4693-4698 
(1990); and Iyer et at.. J. Am. Chem. Soc. /72:1253-1254 (1990), the 
disclosures of which are fully incorporated by reference herein. 

As described herein, sequence anaylsis of an NTP cDNA clone shows 
that NTP contains sequences which are nonhomologous to PTP DNA 
sequences (see Figure 9). Thus, the NTP antisense oligonucleotides of the 
present invention may be RNA or DNA which is complementary to and stably 
hybridizes with such sequences which are specific for an NTP. Use of an 
oligonucleotide complementary to this region allows for the selective 
hybridization to NTP mRNA and not to mRNA specifying PTP. Preferably, 
the NTP antisense oligonucleotides of the present invention are a 15 to 30-mer 
fragment of the antisense DNA molecule coding for the nonhomologous 
sequences of the AD 3-4 cDNA, such as: 

1. 5 '-CCGATTCCAACAGACCATCAT-3 ' [SEQ ID NO: 1]; 

2. 5'-CCAACAGACCATCATTCCACC-3' [SEQ ID NO: 2]; and 

3. 5'-CCAAACCGATTCCAACAGACC-3' [SEQ ID NO: 3], 
Preferred antisense oligonucleotides bind to the 5 '-end of the AD 10-7 

mRNA. Such antisense oligonucleotides may be used to down regulate or 
inhibit expression of the NTP gene. Examples of such antisense 
oligonucleotides (30-mers) include: 

1. 5 '-CCTGGGCAACAAGAGCGAAAACTCCATCTC-3 ' [SEQ 

ID NO: 4]; 

2. 5 '-ATCGCTTGAACCCGGGAGGCGGAGGTTGCG-3 ' [SEQ 
ID NO: 5]; and 

3. 5'-GGGGAGGCTGAGGCAGGAGAATCGCTTGAA-3'[SEQ 
ID NO: 6]. 

Included as well in the present invention are pharmaceutical 
compositions comprising an effective amount of at least one of the NTP 
antisense oligonucleotides of the invention in combination with a pharma- 
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ccutically acceptable carrier. In one embodiment, a single NTP antisense 
oligonucleotide is utilized. In another embodiment, two NTP antisense 
oligonucleotides are utilized which arc complementary to adjacent regions of 
the NTP genome. Administration of two NTP antisense oligonucleotides 
5 which are complementary to adjacent regions of the genome or corresponding 

mRNA may allow for more efficient inhibition of NTP genomic transcription 
or mRNA translation, resulting in more effective inhibition of NTP 
production. 

Preferably, the NTP antisense oligonucleotide is coadministered with 
10 an agent which enhances the uptake of the antisense molecule by the cells. 

For example, the NTP antisense oligonucleotide may be combined with a 
lipophilic cationic compound which may be in the form of liposomes. The 
use of liposomes to introduce nucleotides into cells is taught, for example, in 
U.S. Patent Nos. 4,897,355 and 4,394,448, the disclosures of which are 
15 incorporated by reference in their entirety. See also U.S. Patent Nos. 

4,235,871, 4,231,877, 4,224,179, 4,753,788, 4,673,567, 4,247,411, 
4,814,270 for general methods of preparing liposomes comprising biological 
materials. 

Alternatively, the NTP antisense oligonucleotide may be combined with 
20 a lipophilic carrier such as any one of a number of sterols including 

cholesterol, chelate and deoxycholic acid. A preferred sterol is cholesterol. 

In addition, the NTP antisense oligonucleotide may be conjugated to 
a peptide that is ingested by cells. Examples of useful peptides include 
peptide hormones, antigens or antibodies, and peptide toxins. By choosing a 
25 peptide that is selectively taken up by the neoplastic cells, specific delivery of 

the antisense agent may be effected. The NTP antisense oligonucleotide may 
be covalently bound via the 5 'OH group by formation of an activated 
aminoalkyl derivative. The peptide of choice may then be covalently attached 
to the activated NTP antisense oligonucleotide via an amino and sulfhydryl 
30 reactive hetero bifimctional reagent. The latter is bound to a cysteine residue 

present in the peptide. Upon exposure of cells to the NTP antisense 
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oligonucleotide bound to the peptide, the peptidyl antisense agent is 
endocytosed and the NTP antisense oligonucleotide binds to the target NTP 
mRNA to inhibit translation (Haralambid et al t WO 8903849; Lebleu et al % 
EP 0263740). 

The NTP antisense oligonucleotides and the pharmaceutical composi- 
tions of the present invention may be administered by any means that achieve 
their intended purpose. For example, administration may be by parenteral, 
subcutaneous, intravenous, intramuscular, intra-peritoneal, or transdermal 
routes. The dosage administered will be dependent upon the age, health, and 
weight of the recipient, kind of concurrent treatment, if any, frequency of 
treatment, and the nature of the effect desired. 

Compositions within the scope of this invention include all composi- 
tions wherein the NTP antisense oligonucleotide is contained in an amount 
effective to achieve inhibition of proliferation and/or stimulate differentiation 
of the subject cancer cells, or alleviate AD. While individual needs vary, 
determination of optimal ranges of effective amounts of each component is 
with the skill of the art. Typically, the NTP antisense oligonucleotide may be 
administered to mammals, e.g. humans, at a dose of 0.005 to 1 mg/kg/day, 
or an equivalent amount of the pharmaceutical^ acceptable salt thereof, per 
day of the body weight of the mammal being treated. 

Alternatively, antisense oligonucleotides can be prepared which are 
designed to interfere with transcription of the NTP gene by binding 
transcribed regions of duplex DNA (including introns, exons, or both) and 
forming triple helices (Froehler et al , WO 91/06626; Toole, WO 92/10590). 
Preferred oligonucleotides for triple helix formation are oligonucleotides 
which have inverted polarities for at least two regions of the oligonucleotide 
(Id.). Such oligonucleotides comprise tandem sequences of opposite polarity 
such as 3'— 5'-I^5'— 3', or 5'— 3'-L-3'— 5', wherein L represents a 0-10 
base oligonucleotide linkage between oligonucleotides. The inverted polarity 
form stabilizes single-stranded oligonucleotides to exonuclease degradation 
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(Frochler et a!., supra). Preferred triple helix-forming oligonucleotides are 

based upon SEQ ID NOs 1-3: 

1. 3'-TACTACCAGACAACCTTAGCC-5'-L- 
5'-CCGATTCCAACAGACCATCAT-3' [SEQ ID NO: 7]; 

2. 5'-CCGATTCCAACAGACCATCAT-3'-Lr 
3'-TACTACCAGACAACCTTAGCC-5' [SEQ ID NO: 8]; 

3. 3'-CCACCTTACTACCAGACAACC-5'-U 
5'-CCAACAGACCATCATTCCACC-3' [SEQ ID NO: 9J; 

4. 5'-CCAACAGACCATCATTCCACC-3'-I^ 
3'-CCACCTTACTACCAGACAACC-5' [SEQ ID NO: 10]; 

5. 3'-CCAGACAACCTTAGCCAAACC-5'-L- 
5'-CCAAACCGATTCCAACAGACC-3' [SEQ ID NO: 11]; 

and 

6. 5'-CCAAACCGATTCCAACAGACC-3'-Lr 
3*-CCAGACAACCTTAGCCAAACC-5' [SEQ ID NO: 12]. 

Thus, triple helix-forming oligonucleotides 1 and 2 are represented as 3'[SEQ 
ID NO: 1]5'-L-5'[SEQ ID NO: 1]3* and 5'[SEQ ID NO: 1]3'-L-3'[SEQ ID 
NO: 1]5', respectively. Triple helix-forming oligonucleotides 3 and 4 are 
represented as 3'[SEQ ID NO: 2]5'-L-5'[SEQ ID NO: 2J3' and 5'[SEQ ID 
NO: 2]3'-L-3'[SEQ ID NO: 2]5', respectively. Triple helix-forming 
oligonucleotides 5 and 6 are represented as 3'[SEQ ID NO: 3]5'-L-5'[SEQ ID 
NO: 3]3' and 5'[SEQ ID NO: 3]3'-W[SEQ ID NO: 3]5', respectively. Of 
course, similar triple helix-forming oligonucleotide may be prepared with SEQ 
ID NOs. 4-6, or fragments thereof. 

In therapeutic application, the triple helix-forming oligonucleotides can 
be formulated in pharmaceutical preparations for a variety of modes of 
administration, including systemic or localized administration, as described 
above. 

The antisense oligonucleotides of the present invention may be 
prepared according to any of the methods that are well known to those of 
ordinary skill in the art, as described above. 
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Ribozymes provide an alternative method to inhibit mRNA function. 
Ribozymes may be RNA enzymes, self-splicing RNAs, and self-cleaving 
RNAs (Cech etaU Journal of Biological Chemistry 267:17479-17482 (1992)). 
It is possible to construct de novo ribozymes which have an endonuclease 

5 activity directed in trans to a certain target sequence. Since these ribozymes 

can act on various sequences, ribozymes can be designed for virtually any 
RNA substrate. Thus, ribozymes are very flexible tools for inhibiting the 
expression of specific genes and provide an alternative to antisense constructs. 
A ribozyme against chloramphenicol acetyltransferase mRNA has been 

10 successfully constructed (Haseloff et al, Nature 354:585-591 (1988); 

Uhlenbeck et al. Nature 525:596-600 (1987)). The ribozyme contains three 
structural domains: 1) a highly conserved region of nucleotides which flank 
the cleavage site in the 5' direction; 2) the highly conserved sequences 
contained in naturally occurring cleavage domains of ribozymes, forming a 

15 base-paired stem; and 3) the regions which flank the cleavage site on both 

sides and ensure the exact arrangement of the ribozyme in relation to the 
cleavage site and the cohesion of the substrate and enzyme. RNA enzymes 
constructed according to this model have already proved suitable in vitro for 
the specific cleaving of RNA sequences (Haseloff et al , supra). 

20 Alternatively, hairpin ribozymes may be used in which the active site 

is derived from the minus strand of the satellite RNA of tobacco ring spot 
virus (Hampel etal. Biochemistry 28:4929-4933 (1989)). Recently, a hairpin 
ribozyme was designed which cleaves human immunodeficiency virus type 1 
RNA (Ojwang etal 9 Proc. Natl Acad. Sci. USA 89:10802-10806 (1992)). 

25 Other self-cleaving RNA activities are associated with hepatitis delta virus 

(Kuo et al, J. Virol 62:4429^444 (1988)). 

As discussed above, preferred targets for NTP ribozymes are the 
nucleotide sequences which are not homologous with FTP sequences. 
Preferably, the NTP ribozyme molecule of the present invention is designed 

30 based upon the chloramphenicol acetyltransferase ribozyme or hairpin 

ribozymes, described above. Alternatively, NTP ribozyme molecules are 
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dcsigned as described by Eckstein et al (International Publication No. WO 
92/07065) who disclose catalytically active ribozyme constructions which have 
increased stability against chemical and enzymatic degradation, and thus are 
useful as therapeutic agents. 

In an alternative approach, an external guide sequence (EGS) can be 
constructed for directing the endogenous ribozyme, RNase P, to intracellular 
NTP mRNA, which is subsequently cleaved by the cellular riboTyme (Altman 
et al f U.S. Patent No. 5,168,053). Preferably, the NTP EGS comprises a 
ten to fifteen nucleotide sequence complementary to an NTP mRNA and a 3'- 
NCCA nucleotide sequence, wherein N is preferably a purine (Id.). After 
NTP EGS molecules are delivered to cells, as described below, the molecules 
bind to the targeted NTP mRNA species by forming base pairs between the 
NTP mRNA and the complementary NTP EGS sequences, thus promoting 
cleavage of NTP mRNA by RNase P at the nucleotide at the 5 'side of the 
base-paired region (Id.). 

Included as well in the present invention are pharmaceutical 
compositions comprising an effective amount of at least one NTP ribozyme 
or NTP EGS of the invention in combination with a pharmaceutical^ 
acceptable carrier. Preferably, the NTP ribozyme or NTP EGS is 
coadministered with an agent which enhances the uptake of the ribozyme or 
NTP EGS molecule by the cells. For example, the NTP ribozyme or NTP 
EGS may be combined with a lipophilic cationic compound which may be in 
the fonn of liposomes, as described above. Alternatively, the NTP ribozyme 
or NTP EGS may be combined with a lipophilic carrier such as any one of a 
number of sterols including cholesterol, cholate and deoxycholic acid. A 
preferred sterol is cholesterol. 

The NTP ribozyme or NTP EGS, and the pharmaceutical compositions 
of the present invention may be administered by any means that achieve their 
intended purpose. For example, administration may be by parenteral, 
subcutaneous, intravenous, intramuscular, intra-peritoneal, or transdermal 
routes. The dosage administered will be dependent upon the age, health, and 
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weight of the recipient, kind of concurrent treatment, if any, frequency of 
treatment* and the nature of the effect desired. For example, as much as 700 
milligrams of antisense oligodeoxynucleotide has been administered 
intravenously to a patient over a course of 10 days (i.e., 0.05 mg/kg/hour) 
5 without signs of toxicity (Sterling, "Systemic Antisense Treatment Reported, " 

Genetic Engineering News 12(12):l f 28 (1992)). 

Compositions within the scope of this invention include all composi- 
tions wherein the NTP ribozyme or NTP EGS is contained in an amount 
which is effective to achieve inhibition of proliferation and/or stimulate 

10 differentiation of the subject cancer cells, or alleviate AD. While individual 

needs vary, determination of optimal ranges of effective amounts of each 
component is with the skill of the art. 

In addition to administering the NTP antisense oligonucleotides, 
ribozymes, or NTP EGS as a raw chemical in solution, the therapeutic 

15 molecules may be administered as part of a pharmaceutical preparation 

containing suitable pharmaceutically acceptable carriers comprising excipients 
and auxiliaries which facilitate processing of the NTP antisense 
oligonucleotide, ribozyme, or NTP EGS into preparations which can be used 
pharmaceutically. 

20 Suitable formulations for parenteral administration include aqueous 

solutions of the NTP antisense oligonucleotides, ribozymes, NTP EGS in 
water-soluble form, for example, water-soluble salts. In addition, suspensions 
of the active compounds as appropriate oily injection suspensions may be 
administered. Suitable lipophilic solvents or vehicles include fatty oils, for 

25 example, sesame oil, or synthetic fatty acid esters, for example, ethyl oleate 

or triglycerides. Aqueous injection suspensions may contain substances which 
increase the viscosity of the suspension include, for example, sodium 
carboxymethyl cellulose, sorbitol, and/or dextran. Optionally, the suspension 
may also contain stabilizers. 

30 Alternatively, NTP antisense RNA molecules, NTP ribozymes, and 

NTT EGS can be coded by DNA constructs which are administered in the 
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fonn of virions, which are preferably incapable of replicating in vivo (see, for 
example, Taylor, WO 92/06693). For example, such DNA constructs may 
be administered using herpes-based viruses (Gage et aL, U.S. Patent No. 
5,082,670). Alternatively, NTP antisense RNA sequences, NTP ribozymcs, 
and NTP EGS can be coded by RNA constructs which are administered in the 
form of virions, such as retroviruses. The preparation of retroviral vectors 
is well known in the art (see, for example. Brown et aL, "Retroviral 
Vectors," in DNA Cloning: A Practical Approach, Volume 3, IRL Press, 
Washington, B.C. (1987)). 

Specificity for gene expression in the central nervous system can be 
conferred by using appropriate cell-specific regulatory sequences, such as cell- 
specific enhancers and promoters. For example, such sequences include the 
sequences that regulate the oligodendroglial-specific expression of JC virus, 
glial-specific expression of the proteolipid protein, and the glial fibrillary 
acidic protein genes (Gage et aL, supra). Since protein phosphorylation is 
critical for neuronal regulation (Kennedy, "Second Messengers and Neuronal 
Function." in An Introduction to Molecular Neurobiology, Hall, Ed., Sinauer 
Associates, Inc. (1992)), protein kinase promoter sequences can be used to 
achieve sufficient levels of NTP gene expression. 

Thus, gene therapy can be used to alleviate AD by inhibiting the 
inappropriate expression of a particular form of NTP. Moreover, gene 
therapy can be used to alleviate AD by providing the appropriate expression 
level of a particular form of NTP. In this case, particular NTP nucleic acid 
sequences may be coded by DNA or RNA constructs which are administered 
in the form of viruses, as described above. Alternatively, "donor cells" may 
be modified in vitro using viral or retroviral vectors containing NTP 
sequences, or using other well known techniques of introducing foreign DNA 
into cells (see, for example, Sambrook et aL, supra). Such donor cells 
include fibroblast cells, neuronal cells, glial cells, and connective tissue cells 
(Gage et aL, supra). Following genetic manipulation, the donor cells are 
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grafted into the central nervous system and thus, the genetically-modified cells 

provide die therapeutic form of NTP (Id). 

Moreover, such virions may be introduced into the blood stream for 

delivery to the brain. This is accomplished through the osmotic disnxption of 
5 the blood brain barrier prior to administration of the virions (see, for example, 

Neuwelt, United States Patent No. 4,866,042). The blood brain barrier may 

be disrupted by administration of a pharmaceutical^ effective, nontoxic 

hypertonic solution, such as mannitol, arabinose, or glycerol (Id.). 

The following clones in E. coli were deposited according to the 
10 Budapest Treaty with the American Type Culture Collection (12301 Parklawn 

Drive, Rockville, Maiyland, 20852): G2-2 PstI-DH5 (ATCC No. 69257); 

G5d-PstI-DH5 (ATCC No. 69258); l-9a-LX-l blue (ATCC No. 69259); 

AD3^-DH1 (ATCC No. 69260); HB4-XL4>lue (ATCC No. 69261); AD10-7- 

DH1 (ATCC No. 69262); AD2-2-DH1- (ATCC No. 69263); G5d-lPstI- 
15 EcoRI-DH5 (ATCC No. 69264); and G2-2PstI-EcoRI-DH5 (ATCC No. 

69265). 

Having now generally described the invention, the same will be more 
readily understood through reference to the following Examples which are 
provided by way of illustration, and are not intended to be limiting of the 
20 present invention, unless specified. 

Example 1 

Expression of NTP Immunoreactivify in Cell Lines 

Seven cell lines of central nervous system origin were identified that 
express thread protein immunoreacUvity using the Th9 monoclonal antibody 
25 which was generated to the pancreatic form of the protein (Gross et al. J. 

din. Invest. 76:2115-2126 (1985)), but cross-reacts with thread proteins 
present in brain tissue and cerebrospinal fluid (Ozturk et of., Proc. Natl 
Acad. Sci. USA 56:419-423 (1989); de la Monte et. al, J. Clin. Invest. 
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56:1004-1013 (1990); de la Monte et. al., J. Neurol. Sti. 113: 152-164 (1992); 
de la Monte et al., Ann. Neurol 52:733-742 (1992)). Among them were the 
following: two primitive neuroectodermal tumor (PNET) cell lines designated 
PNETl and PNET2; three glioblastoma cell lines Hgl 16, Hgl 17, and C6; 

5 the A172 glial cell line; and the SH-Sy5y neuroblastoma cell line. The 

glioblastoma cell lines and the A172 cells were obtained from the American 
Type Culture Collection (ATCC). SH-Sy5y cells were obtained from Dr. 
Biedler at Sloan-Kettering Memorial Hospital. The PNET cell lines have been 
described previously (The et al, Nature genetics 5:62-66 (1993)), and were 

10 obtained from Dr. Rene' Bernards at the MGH Cancer Center. All cell lines 

were maintained in Earl's Modified Eagle Medium supplemented with 10% 
fetal calf serum, and without antibiotics. 

To examine the cells for thread protein and other immunoreactivities, 
the cultures were harvested in phosphate buffered saline (PBS) (137 mM 

15 NaCl, 2.7 mM KCI, 4.3 mM Na,HP04, 1 .4 mM KH 2 PO«. pH 7.3) containing 

2 mM EDTA, and cytospin preparations were made using 10 5 cells per slide. 
The cytospin preparations were fixed immediately in 100% methanol (-20 8 C), 
air-dried, and then stored at -80°C until used. Prior to immunostaining. the 
slides were equilibrated to room temperature and hydrated in PBS. 

20 Nonspecific antibody binding was blocked with 3 % nonimmune horse serum. 

Replicate cytospin preparations from the same cultures were incubated 
overnight at 4 e C with 5 or 10 /ig/ml of primary antibody. Immunoreactivity 
was revealed by the avidin-biotin horseradish peroxidase method using the 
Vectastain Elite kit (Vector Laboratories, Buriingame, CA) according to the 

25 manufacturer's protocol, and with 3-3' diaminobenzidine (0.5 mg/ml plus 

0.03% hydrogen peroxide) as the chromogen. The cells then were 
counterstained with hematoxylin, dehydrated in graded alcohol solutions, 
cleared in xylenes, and preserved under coverglass with Permount (Fisher 
Scientific). 

30 Cytospin preparations of each cell line were immunostained with the 

thread protein monoclonal antibodies Th9, Th7, ThlO, Th29, Th34, TH46, 
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Th67, and Th90. In addition, replicate slides were immunostained with 
positive (neurofilament, glial fibrillary acidic protein (GFAP), and vimentin) 
and negative (desmin, Hepatitis B surface antigen-5C3) control monoclonal 
antibodies. Except for 5C3 which was generated in the inventor's laboratory 
(Fujita et al. Gastroenterology 92:1357-1363 (1986)), the control antibodies 
were purchased (Boehringer-Mannheim). All serological reagents were 
diluted in PBS containing 1% bovine serum albumin (BSA), and all 
incubations except the one with primary antibody were carried out at room 
temperature in humidified chambers. The slides were washed in 3 changes of 

PBS between each step. 

Both PNET1 and PNET2 cells expressed high and middle molecular 
weight neurofilament proteins and little or no glial fibrillary acidic protein or 
vimentin. The PNET1, PNET2, and SH-Sy5y cells expressed GAP-43, an 
abundant calmodulin-binding phosphoprotein that is highly expressed in 
immature neurons and in neurons undergoing regenerative cell growth 
(Benowitzrta/., /. Neurosci. 3:2153-2163 (1983); DeGraantfa/.. Neurosci. 
Lett. 62:235-241 (1985); Kalil et al., J. Neurosci. 6:2563-2570 (1986)). The 
A172 and C6 cells expressed GFAP and vimentin. However, A172 also 
exhibited neurofilament inununoreactivity, raising doubt about its purely glial 
cell nature. None of the cell lines manifested inununoreactivity with 
monoclonal antibodies to desmin or to Hepatitis B surface antigen. As a 
negative control cell line, the Huh7 hepatocellular carcinoma cell line was 
similarly immunostained, and found not to exhibit any immunoreactivity with 
the above antibodies. However, the Huh cells were immunoreactive with 
monoclonal antibodies to the insulin receptor substrate protein, IRS-1 (data not 
shown) which was used as a positive control for this cell line (Sasaki et al, 
J. Biol. Cnem. 265:1-4 (1993)). 

Using the Th9 monoclonal antibody, thread protein immunoreactivity 
was detected in primary PNET (A), primary glioblastoma (F). PNET1 (B), 
and C6 cells (G), but not in hepatocellular carcinoma cell lines (Figures 1A- 
1J). In addition. Th9 immunoreactivity was detected in histological sections 
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from 8 of the 9 primary human CNS PNETs. and from all 5 of the primary 
human glioblastomas studied (Figures 1A-1J). Although all 5 cell lines 
exhibited intense immunoreactivity with the Th9 monoclonal antibody, they 
differed with respect to immunoreactivity for other Th monoclonal antibodies. 

5 The immunostaining reaction generated with the ThlO (C,H), Th7 (D,I), or 

Th46 monoclonal antibodies was either low-level (C,D) or absent (H,I,E,J) 
in PNET1 (C-E) and C6 (H-J). PNET2 ceUs exhibited only low levels of 
immunoreactivity with Th7 and Th29, and they manifested no immunostaining 
with the other Th monoclonal antibodies. A172, C6, and SH-Sy5y cells 

10 displayed little or no immunoreactivity with Th monoclonal antibodies other 

than Th9. Huh7 cells exhibited no immunoreactivity with any of the thread 
protein monoclonal antibodies employed, whereas human pancreatic tissue was 
immunoreactive with all of the Th antibodies, which had been generated 
against the purified pancreatic form of thread protein (Gross et al, J. Clin. 

15 Invest. 76:2115-2126 (1985)). 



Example 2 

Analysis of Thread Proteins by Monoclonal Antibody-Based 
Immunoradiometric Assay (M-IRMA) 



Cultured cells were washed in PBS and recovered in PBS containing 
20 2 mM EDTA. The cells were pelleted by centrifugation at 1000 x g for 15 

min, and then resuspended in lysis buffer containing 50 mM Tris-HCl (pH 
7.5), 1% Triton X-100, 2 mM EGTA, 10 mM EDTA, 100 mM NaF, 1 mM 
Na4P 2 0 7 , 2 mM Na 3 V0 4 , 100 M g/ml phenylmethylsulfonyl fluoride, 1 /xg/ml 
aprotinin, 1 /xg/ml pepstatin A, and 1 jig/ml leupeptin. The supernatant 
25 fractions obtained by centrifugation of the lysates at 14,000 x g for 10 min 

were used for the Western blot analysis, immunoprecipitation studies, and M- 
IRMA. Protein concentration was determined by the Lowry colorimetric 
assay. The samples were stored at -40°C. 
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M-IRMA is a highly sensitive two- or three-site forward sandwich 
assay which permits quantitation of picomolar NTP in cell ly sates, tissue 
culture medium, tissue homogenates t and body fluids (Ozturk et al , Proc. 
Natl. Acad. ScL USA 55:419-423 (1989); de la Monte et. al, J. Clin. Invest. 
56:1004-1013 (1990); de la Monte et. al., /. Neurol. ScL 773:152-164 (1992); 
de la Monte et at., Ann. Neurol 32:733-742 (1992); Gross et al., J. Clin. 
Invest. 76:2115-2126 (1985)). In addition, when combined with SDS-PAGE, 
M-IRMA can be used to determine molecular size of thread proteins and 
related species (Ozturk etal, Proc. Natl Acad. ScL USA 56:419-423 (1989); 
de la Monte et. al, J. Clin. Invest. 56:1004-1013 (1990); de la Monte et. al, 
J. Neurol ScL 773:152-164 (1992); de la Monte et al, Ann. Neurol 32:733- 
742 (1992)). M-IRMA involves capturing the immunoreactive thread proteins 
present in biological samples using monoclonal antibodies Th7 and ThlO 
affixed to a solid-phase matrix, and then detecting the captured antigen with 
a third radiolabeled tracer monoclonal antibody (Th9) to the same protein. 
Briefly, 1/4" polystyrene beads (Precision Ball, Inc) were coated with one or 
two monoclonal antibodies to thread proteins (usually Th7 + ThlO). Cell 
lysates or supernatant fractions of tissue homogenates (Ozturk et al, Proc. 
Natl Acad. ScL USA 56:419-423 (1989); de la Monte et. al, J. Clin. Invest. 
56:1004-1013 (1990); de la Monte et. al, J. Neurol ScL 773:152-164 (1992); 
de la Monte et al, Ann. Neurol 32:733-742 (1992)) were incubated over 
night with the coated beads to capture thread proteins present in the samples. 
The beads were washed 5x in PBS, and then incubated with ,25 -I labeled Th9 
as a tracer to detect the captured thread proteins. The concentration of thread 
protein in the lysate or tissue homogenate was determined from a standard 
curve generated with known quantities of purified thread protein. This highly 
sensitive assay can detect as little as 10 pmol of thread protein in solution. 
To assay for thread proteins fractionated by SDS-PAGE, the wet gels were 
sliced at 2 mm intervals, and the proteins were eluted from each fraction into 
0.5 ml of PBS by shaking for 24 hours at room temperature. The eluates 
were assayed directly for thread proteins by M-IRMA. 
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Corresponding with the widespread immunocytochemical staining of 
PNET1 cells with Th7. ThlO, Th34. and Th29, thread protein 
immunorcactivity was readily measured in these cells by M-ffiMA. In other 
words, with Th7, ThlO, Th34, and Th29 monoclonal antibodies (MoAb) used 

5 as capture antibodies, either singularly or with two of them together, and m -I 

labeled Th9 was used as the tracer, sirailarly high levels of thread protein 
were measured (Figure 2). In contrast, in PNET2, C6, and A172 cells, which 
exhibited intense immunoreactivity with Th9, but little or no 
fanmiiiiocytoGtaeiiiical staining with the Th monoclonal antibodies that were 

10 used to capture antigen, the levels of thread protein detected by M-IRMA 

were much lower than those measured in the PNET1 cells (Figure 2). 
Similarly, Huh7 cells, which manifested no inununocytochemical staining with 
any of the thread protein monoclonal antibodies, had virtually nondelegable 
levels of thread proteins in the cellular lysates by M-IRMA. The 

15 concentrations of thread protein in the cell lysates were computed from a 

standard curve generated with purified FTP using Th7 and ThlO as capture 
antibodies. The results expressed as mean S.D. pg/mg of total protein were 
as follows: PNET1-13.1 ± 0.39; PNET2-2.06 ± 0.10; A172-3.38 ± 0.37; 
C6-2.52 ± 0.22; and Huh7-0.34 ± 0.05. 

20 Example 3 

Characterization of Neural Thread Proteins in Tumor Cell Lines 

In Western Blot anaylsis, samples containing 100 fig of protein were 
fractionated by SDS-PAGE, along with pre-labeled molecular weight 
standards. The proteins were blotted onto nylon membranes (Immobilon-P 
25 transfer membrane, Millipore) using a semi-dry transfer apparatus (Integrated 

Systems). The membranes were washed in Tris buffered saline (TBS; 10 mM 
Tris, 0,85% sodium chloride, P H 7.5), and then blocked with TBS containing 
3% BSA. The blots were incubated overnight at 4°C with m -I labeled Th9 
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monoclonal antibody. Unspccifically bound probe was removed by washing 
the membranes at room temperature in TBS-BSA 3 x 15 min, and 1 x 30 min. 
The results were analyzed by autoradiography using Kodak XAR film. 

To prepare samples for immunoprecipitation studies, one milliliter 
samples of cell lysate containing approximately 1 mg/ral of protein were used 
for immunoprecipitation studies. The lysates were initially pre-cleared with 
non-relevant antibody (SC3 or antidesmin), and then with Protein A 
sepharose. Thread proteins were immunoprecipitated using 5-10 fig of Th9 
and Protein A sepharose (Sasaki et al t J. Biol Chem. 265:1-4 (1993)). The 
immune complexes collected by centrifugation were resuspended in buffer 
containing 2% SDS and 10 mM 0-mercaptoethanol, and then subjected to 
SDS-PAGE under denaturing and reducing conditions (Id.). Crude cellular 
lysates (100 fig protein) were analyzed simultaneously. The proteins were 
blotted onto Immobilon-P membranes and probed with l23 -I labeled (Id.) Th9 
to detect thread proteins and related molecules. Negative control experiments 
were performed simultaneously using either monoclonal antibodies to Hepatitis 
B surface antigen (5C3) or to desmin. 

Metabolic labeling experiments were performed using monolayers of 
cells cultured in 100 mm 2 petri dishes. Prior to labeling, the cells were 
exposed to methionine- and cysteine-free medium for 2 h. The medium was 
then replaced with 3 ml of DMEM containing 300 /iCi each of [ 35 S] 
methionine or [ 35 S] cysteine. After labeling for 3 hours, the cells were 
incubated for various intervals with complete medium devoid radiolabeled 
amino acids and supplemented with 10 mM methionine. Cell lysates were 
prepared as described above. Thread proteins were immunoprecipitated using 
the Th9 monoclonal antibody and protein A sepharose, and the 
immunoprecipitation products were analyzed by SDS-PAGE and film 
autoradiography. 

For the in vivo phosphorylation studies, cells cultured as described for 
metabolic labeling studies were washed twice with TBS and incubated for 2 
h with phosphate-free Dulbecco's MEM containing 10% dialyzed fetal calf 
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serum. Then the cells were washed with TBS and incubated for 3 h with the 
same medium containing 400 fiCi/ml c f [ 31 P] orthophosphoric acid. The cell 
lysates were analyzed by immunoprecipitaUon with thread protein, and both 
positive (p36) and negative (desmin) control monoclonal antibodies, followed 
by SDS-PAGE. 

In order to study the glycosylation state of neural thread proteins, cell 
culture lysates containing approximately 100 M g or protein were subjected to 
SDS-PAGE, and the fractionated proteins were transferred to Immobilon-P 
membranes (MUUpore). 0- and N-glycans were detected by periodate 
oxidation followed by biotinylation, and then Western blot analysis with a 
Streptavidin-alkaline phosphatase probe and NBT/BCIP as the colorimetric 
substrate. The assays were performed using the GlycoTrack Kit (Oxford 
Glycosystems, Rosedale, NY) according to the protocol provided by the 
manufacturer. 

Th9-immunoreactive proteins were detected in lysates of PNET1, 
PNET2, SH-Sy5y, C6, and A172 cells by four different methods: Western 
blot analysis, immunoprecipitation followed by Western blot analysis, 
metabolic labeling followed by immunoprecipitation, and SDS-PAGE 
combined with M-IRMA. Western blot analysis of crude cellular lysates using 
« M I-labeled Th9 demonstrated -21 kDa bands in the above cell lines (as 
indicated by the arrow in Figure 3), but the signal intensity was low. In 
contrast, in lysates of human pancreatic tissue, the expected 17 kDauncleaved 
and 14 kDa cleaved forms of pancreatic thread protein were readily detected 
by Western blot analysis (Figure 3). Thread proteins were not detected in 
lysates of human hepatocellular carcinoma cell lines. The strikingly greater 
abundance of thread proteins in pancreatic tissue compared with neuronal and 
glial cell lines is consistent with a previous finding of lOMbld higher levels 
of thread proteins in pancreas and pancreatic juice compared with brain tissue 
and cerebrospinal fluid (Ozturk et al. . Proc. Natl. Acad. Sci. USA 56:419-423 
(1989); de la Monte et. al.. J. Clin. Invest. 56:1004-1013 (1990); de la Monte 
et. al., J. Neurol. Sci. 7J3:152-164 (1992); de la Monte etal., Ann. Neurol. 
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52:733-742 (1992)), Although one would expect that thread proteins 
synthesized by PNET and glial cells are secreted as is the case for PTP and 
NTP, thread proteins were not detected in the tissue culture medium by 
Western blot analysis, even after concentrating the medium four- or five-fold 
5 by lyophilization. 

Th9-immunoreactive thread proteins were more readily detected in 
PNET and glial cell lines by first immunoprecipitating from the lysates with 
either Th7+Thl0 or Th9, and then performing Western blot analysis using 
l25 Mabeled Th9 (direct) (Figure 3), or unlabeled Th9 with l25 I-labeled Protein 

10 A (indirect). Both methods demonstrated 21 kDa thread protein-related 

species, similar to those detected by Western blot analysis. In addition, - 17 
kDa bands were also observed in both PNET and glial cells, but the signal 
was inconsistent and low-level, as determined by Western blot analysis. As 
negative controls, the Huh7, HepG2, and FOCUS (Lun et aL, In Vitro 

15 (Rockville) 20:493-504 (1984)) human hepatocellular carcinoma cell lines were 

studied simultaneously under identical conditions, and Th9-immunoreactive 
proteins were not detected in the cellular lysates. 

The molecular sizes of thread proteins present in PNET and glial cells 
were most prominently demonstrated by metabolical labeling with 3S S- 

20 methionine or 35 S-cysteine, followed by immunoprecipitation using Th9 

monoclonal antibody. Monoclonal antibodies to desmin or to hepatitis B 
surface antigen (5C3) were used as negative controls for immunoprecipitation. 
In both PNET and glial cell lines, -26 and -21 kDa Th9-immunoreactive 
proteins were detected by SDS-PAGE analysis of the immunoprecipitated 

25 products (Figure 4B). In PNET1 cells, the 21 kDa band appeared as a 

doublet (Figure 4A); the accompanying slightly higher molecular weight 
species appeared to be less abundant than the dominant band at -21 kDa. In 
addition, in both PNET and glial cell lines, there were also -17 kDa Th9- 
immunoreactive proteins associated with bands of nearly the same intensity as 

30 the -21 kDa bands. In C6 cells, there were also -26 kDa, - 14-15 kDa 
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and —8 kDa Th9-immunoreactive proteins which were not detected in PNET 
cells (Figures 4A and 4B, arrows). 

The 21 kDa and 17 kDa thread proteins in SH-Sy5y, PNET1, A172, 
and C6 cells, and their absence in hepatocellular carcinoma cells were also 
5 demonstrated by SDS-PAGE/M-IRMA (Figures 5A-5E). Cellular proteins 

fractionated by SDS-PAGE were eluted from the gels sliced at 2 mm intervals, 
and assayed directly for thread protein immunoreactivity by M-IRMA using 
Th7+Thl0 as capture antibodies, and li5 I-labeled Th9 as the tracer. Despite 
low levels, two distinct peaks were evident in all neuroectodermal cell lines, 

10 but not in Huh7 hepatocellular carcinoma cells assayed simultaneously and in 

the same manner. The resolution of these gels did not permit distinction of 
- 17 kDa from ~ 14-15 kDa proteins which might have been present. 

PNET1 and C6 cells were metabolicaUy labeled with 32 P or 35 S- 
methionine, and thread proteins were immunoprecipitated from the lysates 

15 using Th9 monoclonal antibody (Figure 6). As a negative control, 

immunoprecipitation studies were conducted using an equal portion of the 
cellular lysate and monoclonal antibodies to desmin protein (Figure 6, right 
panel). In the cells labeled with M S methionine, Th9-immunoreactive bands 
were detected at -26 kDa and -21 kDa (upper arrows), ~ 17 kDa (lower 

20 arrows), and also at ~ 14-15 kDa (Figure 6). After 32 P labeling, only the 21 

kDa band was observed by immunoprecipitation with Th9 monoclonal 
antibody; the other molecular weight species did not appear to be 
phosphorylated (Figure 6). Phosphorylated Th9-immunoreactive proteins were 
detected in C6 cells, but not in PNET1 cells, but this might be due to less 

25 efficient labeling since PNET1 cells grow slower than C6 cells. No bands in 

the 14 kDa to 26 kDa range were detected using monoclonal antibodies to 
desmin for immunoprecipitation (Figure 6). Carbohydrate moieties were not 
detected in Th9 immunoprecipitated proteins (data not shown). 

The highest concentrations of thread protein were measured in 

30 subconfluent cultures of PNET1 cells, i.e. during the log phase of growth, and 

the lowest concentrations in overnight serum-starved cultures (growth arrest) 
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(Figure 7). Cultures that were 100% confluent also had lower levels of thread 
protein expression compared with proliferating cultures. Huh7 hepatocellular 
carcinoma cells (negative control) were simultaneously studied using identical 
culture conditions, but the levels of thread protein remained low throughout. 

5 Surprisingly, there was no change in the degree of thread protein 

immunocytochemical staining of PNET cells cultured under these various 
conditions. However, the degree to which the levels of thread proteins 
changed by M-IRMA measurement may not have been detectable by 
immunoevtochemistry. Nevertheless, the reduction in cellular thread protein 

10 content induced by serum starvation was associated with a change in the 

phenotype of the cells. When the cells achieved 100% confluence or after 
they had been subjected to overnight serum starvation, the cell bodies reduced 
in size, and they exhibited striking changes in the degree and distribution of 
immunoreactivity for neurofilament protein, GAP-43, and GFAP (Figure 8). 

15 In PNET cultures that were 50% confluent, the cells exhibited punctate and 

often a polar distribution of neurofilament and GAP-43 immunoreactivity, 
whereas 100% confluent and serum-starved PNET cultures exhibited diffuse 
perikaryal immunoreactivity for both neurofilament and GAP-43. The 
punctate immunoreactivity may have corresponded with distribution of 

20 neurofilament and GAP-43 in neurites. In contrast, 50% confluent PNET 

cultures were devoid of GFAP immunoreactivity, while 100% confluent and 
serum-starved cultures contained conspicuous proportions of GFAP-positive 
cells. Moreover, the proportion of GFAP-immunoreactive cells was greatest 
in 100% confluent serum-starved cultures, followed by 50% confluent serum- 

25 starved cultures, and then 100% confluent cultures with medium containing 

10% fetal calf serum. Therefore, the reduction in thread protein levels 
measured in PNET cells subjected to overnight serum starvation may have 
been due to differentiation of the cells toward an astrocytic phenotype. C6 
cells and other glioblastoma cell lines exhibited intense immunoreactivity with 

30 the Th9 monoclonal antibody, but the levels of thread protein measured by M- 
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IRMA were often low, possibly due to low-level immunoreactivity with other 
thread protein antibodies, including Th7 and TH10 (see Figures 1A-1J). 

Example 4 

Cloning of Thread Proteins from Human cDNA Libraries 

5 Human brain cDNA libraries made from 17-18 week old fetal brain 

(Stratagene, Inc., La Jolia, CA), 2 year-old temporal lobe neocortex 
(Stratagene), and end-stage Alzheimer's disease cerebral cortex (In Vitrogen; 
San Diego, CA) were screened using probes generated from a 416 bp DNA 
fragment corresponding to nucleotides 235-650 of the rat PTP cDNA, The 

10 rat PTP cDNA, designated 018, was isolated from a rat pancreatic cDNA 

library using synthetic 60mer DNA probes corresponding to nucleotides 45- 
104 and 345-404 of the published sequence (Terazono et aL, J. Biol. Chem. 
265:2111-2114 (1988); Watanabe et aL, J. Biol. Chem. 265:7432-7439 
(1990)). Approximately 2 x 10 6 plaques or colonies from each library were 

15 screened with low-stringency hybridization using standard techniques (see 

Sarabrook et aL, supra). Putative clones were plaque/colony purified, and the 
DNA inserts were sequenced by the dideoxynucleotide chain termination 
method using T7 polymerase (USB Sequenase; United States Biochemical 
Corp., Cleveland, OH). The sequences were compared with the Genebank 

20 database, and aligned with the nucleic acid sequences of other thread protein 

cDNAs. 

a. CNS Neural Thread Protein cDNA Isolated from Human Fetal Brain 
Library 

A 1.35 kilobase (kb) l-9a CNS thread protein partial cDNA was 
25 isolated in which only a small segment corresponds to an open reading frame, 

and the remainder, to a 3' untranslated region (Figure 9). The sequence of 
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an additional 150 nucleotides was obtained from 5' anchor PCR amplification 
products. A second round of 5' anchor PCR amplification yielded a further 
upstream 600 bp product (Figure 9A). A portion of the l-9a cDNA sequence 
shares significant homology with die 5' end of the human PTP cDNA and the 
Reg gene (Figure 10)- In addition, the initial 5 ' anchor PCR product has 60% 
homology with the 5 f end of the Reg gene, and 63 % homology with Exon 2 
of the human Reg gene (Figure 10A). Moreover, probes generated from the 
590 bp 5'-end fragment of l-9a cDNA hybridized with human brain and 
pancreas mRNA (Figures 12A-12C). The l-9a sequence is also homologous 
with the AD2-2 and AD3-4 cDNAs in that at one end of their completed 
sequences, the overlaps are substantial (Figure 10B). 

b. CNS Neural Thread Protein cDNA Isolated from a Two-Year Old 
Temporal Cortex Library 

The HB4 clone is a 593 base pair partial cDNA that was isolated from 
a 2-year old temporal cortex libraiy. This cDNA contains an open reading 
frame at its 5' end and terminates at nucleotide 275. There is a 
polyadenylation signal beginning at nucleotide 475, and the sequence ends 
with a poly-A tail (Figure 11 A). The deduced amino acid sequence of the 
partial HB4 clone predicts a protein with a molecular weight of 10.4 kDa, and 
a pi of 12.1. The HB4 cDNA exhibits 50% overall nucleic acid homology 
with the human PTP cDNA (Figure 11D), a segment of the human Reg gene 
(Figure HE). 

c. Isolation of Neural Thread Protein cDNAs from an Alzheimer's 
Disease Library 



15 



20 



25 



Using the 018 rat PTP cDNA probe, four related cDNAs were isolated 
from an AD brain library. These clones were designated: AD 2-2, AD 3-4, 
AD 44 and AD 16c (also called AD 10-7) (Figures 16A-16S). 
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The AD 2-2 cDNA is approximately 1.2 kb and it shares significant 
homology with the l-9a cDNA, AD 16c, rat FTP cDNA, and Exon 1 of the 
human Reg gene (Figure 17). The AD 2-2 probe generates a genomic 
Southern blot pattern similar to that obtained with the AD 3-4 probe. Figure 
16E depicts the complete nucleotide sequence of the AD2-2 cDNA clone that 
was isolated from an AD brain library. Random primer generated probes 
based on this sequence hybridized with human brain and neuronal samples but 
not with glial cell lines of with pancreatic RNA. 

Figures 16F, 161, 16J and 16K depict partial nucleotide sequences of 
the AD3-4 cDNA clones that were isolated from an AD brain library. 
Rnadom primer generated AD3-4 probes yielded two mRNA transcripts, 1.6 
kB and 3.4 kB. These mRNA species are over-expressed in AD brains, with 
an average of two-fold elevation compared with aged matched controls 
(N=8). 

The AD cDNA 1.6 kb clone is identical to another clone isolated 
at the same time (AD 5-3) (Figure 18A). The AD 3-4/AD 5-3 cDNA exhibits 
substantial homology with the l-9a 5' anchor PCR products (Figure 18B), as 
well as with the human Reg gene and the Gen2a-EP genomic clone 
(Figure 18B). Southern blot analysis of human genomic DNA with the AD 
3-4 probe revealed a pattern similar to that obtained with the AD 2-2 probe. 

Figures 16L and 16M depict the partial nucleotide sequence of AD 4-4 
which is a 0.8 kb partial cDNA clone which is identical to another cDNA 
isolated at the same time (AD 3-5). This AD 4-4 clone shares substantial 
sequence homology with AD 2-2 and l-9a cDNAs (Figure 19). Figure 16N 
depicts the complete nucleotide sequence of a partial cDNA clone isolated 
from an AD brain library. This cDNA hybridized with brain and neuronal 
cell line mRNA, yielding a single 1.4 kB transcript. 

Figure 160 depicts the nucleotide sequence of the 0.5 kb partial cDNA 
clone AD 16c (also called AD 10-7) that is 72% homologous with AD 2-2, 
and also aligns with human PIP and the human Reg gene sequences (Figures 
20A and 20B). 
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Figure 16R depicts the complete nucleotide sequence of the AD 10-7 
clone that was isolated from an AD brain library. Hybridization of Northern 
blots using either antiscnse cRNA probes or random primer generated DNA 
probes detected 2.6, 1.9. 1.4 and 0.9 kB mRNA transcripts in neuronal cells. 
Neuronal cell lines expressed only the two largest transcripts, while mature 
adult human brains expressed predominantly the two smallest transcripts, and 
either very low or nondetectable levels of the 2.6 kB and 1.9 kB transcripts. 
Using an AD 10-7 probe, Northern blot analysis of RNA obtained from human 
liver, ovary, fallopian tube, colon, stomach, spleen, rectum, thyroid, 12 week 
placenta and kidney was negative. 

Figure 16S depicts the complete nucleotide sequence of the AD 16c 
cDNA clone that was isolated from an AD brain library. Hybridization of 
Northern blots using random primer generated DNA probes yielded the same 
results as obtained with the AD10-7 cDNA clone. The AD16c clone shares 
a 650 bp segment of near identity with AD10-7. In addition, elevated levels 
of AD16c mRNA were detected in AD brains compared with aged control 
brains by Northern blot analysis. 

Example 5 
Analysis of Brain Thread Protein Gene Expression 

Thread protein mRNA expression was examined in the following 
neuroectodermal tumor derived cell lines: central nervous system primitive 
neuroectodermal tumor cells designated PNET1 and PNET2; HGL-16 and 
HGH7 human glioblastoma cells; A172 human glioma cells; C6 rat glioma 
cells; and SH-Sy5y neuroblastoma cells. In addition, human brain tissue from 
patients with Alzheimer's disease or no neurological disease (aged controls), 
and embryonic and postnatally developing rat brain were assayed for thread 
protein mRNA expression. RNA extracted from human and rat pancreas 
served as positive controls. 
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RNA was extracted in 5 M guanidinium isothiocyanate, and then 
isolated by centrifugation through a cesium chloride step gradient (see 
Sambrook et al, supra). RNA was quantified by measuring the absorbance 
at 260 ran and 280 nm. The thread protein mRNA transcript sizes were 
assessed by northern blot analysis, and the levels of expression were evaluated 
by RNA dot blot hybridization. Northern blot analysis was performed by 
electrophoresing samples containing 15 /ig of total cellular RNA through 1% 
agarose-formaldehyde gels. The RNA was transferred to nylon membrane, 
cross-linked with ultraviolet light, and hybridized with probes generated from 
a 600 bp fragment of the 1-9A cDNA clone. The fragment used for 
hybridization studies contained the regions most homologous with the human 
PTP cDNA. The probes were labeled with l 32 P) a-dCTP by the random 
primer method (Amersham Corporation; Arlington Heights, IL). The blots 
were hybridized overnight at 42°C with 2 x 10 6 dpm /ml of probe in buffer 
containing 50% formamide, 5x SSPE, lOx Denhardfs (lOOx Denhardfs is 2% 
Ficoll, 2% bovine serum albumin, 2% polyvinylpyrollidine), 0.5% SDS 
(sodium dodecyl sulfate), and 100 /tg/ml of sheared denatured salmon sperm 
DNA. The membranes were washed in SSPE containing 0,25% SDS using 
standard methods. Autoradiograms were generated by exposing the 
membranes to Kodak XAR film at -80°C. The membranes were subsequently 
stripped of probe and then rehybridized with a synthetic 30mer corresponding 
to 18s RNA to evaluate sample loading. 

Northern analysis of total cellular RNA using probes made from the 
l-9a cDNA disclosed two dominant transcripts in central nervous system 
(CNS) tumor cell lines: one transcript was 1.6 kb, and the other was 0.9 kb 
(Figure 12A). In addition, in the SH-Sy5y neuroblastoma and PNET1 cell 
lines, a larger 4.2 kb mRNA transcript was also detected. The 4.2 kb 
transcript may represent preprocessed mRNA. The same size transcripts were 
detected in adult (R. Brain) and newborn (NB) rat, but the 0.9 kb transcript 
was more abundant in the adult brain whereas the 1.6 kb transcript was more 
abundant in the newborn rat brain. In rat pancreas (R. Pane), only a 0.9 kB 
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transcript was detected, corresponding to the size of rat FTP mRNA 
(Terazono et al.. J. Biol. Chem. 265:2111-2114 (1988); Watanabe et al., J. 
Biol. Chem. 26*5:7432-7439 (1990)). mRNA transcripts were not detected in 
normal liver (Nl Liver). Using a probe generated from the 3' region of the 
l-9a cDNA, the 1.6 kb, but not the 0.9 kb transcript was revealed (Figure 
12B). Using a 30-mer probe corresponding to the most 5'-end of the l-9a 
cDNA, the higher molecular weight mRNA transcripts were detected (Figure 
12C). The 0.9 kb transcript was also evident with longer exposure of the 
blot. 

Northern analysis of human brain RNA disclosed a dominant 1.6 kb 
transcript, but also two and sometimes three smaller transcripts of 1.2 kb, 0.9 
kb, and 0.8 kb (Figure 13B). In contrast to the findings in cell lines, the 4.2 
kb mRNA transcript was seldom observed in adult human brain. 
Hybridization with human pancreas disclosed a 0.8 kb transcript, 
corresponding with the size of PTP mRNA. The transcripts detected in 
human brain and pancreas using l-9a probes were identical in size to the 
transcripts observed using PTP cDNA probes. 

Dot blot RNA hybridization to 5 fig of total RNA using the 600 bp 
fragment of the l-9a cDNA (NTP) demonstrated higher levels of expression 
in AD. compared with aged control brains (Figure 13A). Rehybridization of 
the same membrane with a cDNA corresponding to 0-actin demonstrated 
similar loading of RNA in each dot. The observation of elevated levels of 1- 
9a-related mRNA in AD brain tissue is similar to that reported previously 
using 60mer probes corresponding to human PTP cDNA (de la Monte et. al., 
J. CUn. Invest. 56:1004-1013 (1990)). The differences between AD and 
control brains appeared to be due to differences in the levels of the 1.6 kb, 
0.9 kb and 0.8 kb transcripts, as shown in Figures 13A and 13B. 

The AD-NTP 3-4 cDNA, isolated from the AD library, hybridizes with 
RNA from neuronal-derived neuroectodermal tumor cell lines, and human 
brain tissue. In the cell lines, 1.6 kb and 0.9 kb transcripts as observed with 
the l-9a probe were detected (Figure 21C). However, in human brain, ~4 



kb, 1.6 kbi and 0.9 kb transcripts were detected, and the levels of expression 
for all three transcripts were higher in AD compared with aged control brains 
(Figure 21D). 

AD 4-4 cDNA probe hybridized only with a 0.9 kb transcript, and 
only in neuronal cell lines. 

Example 6 

Direct Cloning and Sequencing of Thread Protein cDNAsfrom 
Neuroectodermal Tumor Ceil Lines and Ahheimer's Disease Brain 

Thread protein cDNAs were cloned directly from PNET1, PNET2, 
SH-Sy5y f and A172 cells, and from Alzheimer's disease and aged control 
brain RNA using the 3'- and S'-RACE methods (Frohman et al, Proc. Natl 
Acad. ScL USA 55:8998 (1988); Ohara et al, Proc. Natl Acad. Scl USA 
86:5673 (1989); Loh et al, Science 243:217 (1989)). Briefly, RNA was 
reverse transcribed using oligo-dT primers. For the 5 '-RACE reaction, the 
cDNAs were amplified by polymerase chain reaction (PCR) using a specific 
17-mer corresponding to a 5'-region of the l-9a sequence, and a 17 dT 
primer. The resulting PCR products were subjected to another round of 
amplification using another internal but overlapping 5 '-end primer, and a 
specific 3 '-17-mer corresponding to a 3' region of the l-9a sequence. For the 
3 '-RACE reactions, the cDNAs were first tailed with dCTP using terminal 
deoxynucleotide transferase, and then they were amplified using a specific 
17-mer corresponding to nucleotides 781-797 of the 1 -9a clone and dG 
(17mer). A second nested PCR amplification was performed using a specific 
17mer corresponding to nucleotides 766-792 at the 3' end, and dGTP (17mer) 
for the 5' end. The PCR products were subjected to Southern blot analysis 
using probes generated from an internal DNA fragment of the l-9a cDNA 
clone, and from the 018 rat PTP cDNA clone. The PCR products were gel 
purified and ligated into pAmpI vectors using uracil deoxytransferase. The 
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subcloned DNA inserts were sequenced by the dideoxynucleotide chain 
termination method using T7 DNA polymerase. 

CNS thread protein transcripts were detected in neuroectodermal tumor 
cell lines and in AD human brain tissue by reverse transcription followed by 
PCR using specific primers corresponding to the 5' and 3' regions of the l-9a 
cDNA sequence. Southern blot analysis of the PCR products demonstrated 
two dominant cross-hybridizing species, 0.8 kb and 1.0 kb (Figures 14A and 
14B). In addition, in the SH-Sy5y cells, a larger 1.8 kb PCR product was 
also detected. In the PNET1, PNET2, SH-Sy5y, and A172 cells, a 0.4 kb 
PCR product that hybridized with the l-9a probe was observed. 
Corresponding with the higher levels of thread protein mRNAs in Alzheimer's 
disease brains, the hybridization signal was more intense in AD samples 
compared with aged control samples. 

The PCR products generated from the SH-Sy5y cells were subcloned 
and sequenced. Southern analysis of the cloned fragments exhibited intense 
hybridization with the l-9a cDNA, and less intense but definite hybridization 
with the 018 cDNA (rat PTP) (Figure 14C). The nucleic acid sequence of 
the SH-Sy5y PCR clone (Sy-NTP) was identical to the l-9a cDNA sequence. 

Example 7 

Isolation of Genomic Clones Coding for Human Brain Thread Proteins 

A human genomic DNA library was screened using probes made with 
a 600 bp fragment of the l-9a human brain thread protein cDNA that was 
isolated from the two year-old temporal cortex library. The l-9a cDNA 
fragment contained a region with 60% nucleic acid sequence homology with 
human PTP. After colony purification, the putative genomic clones were 
checked for cross-hybridization with the 018 rat PTP cDNA fragment. 
EcoRl, Pstl, and EcoRI/Pstl restriction fragments that hybridized with both 
the l-9a and 018 probes were subcloned into pBIuescript II vectors (Promega, 
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Inc., Madison, WI) and then sequenced by the dideoxynucleotide chain 
tennination method using either T7 polymerase (USB Sequenase) or 
polymerase chain reaction amplification and Vent polymerase. 

Four genomic fragments designated G2-2PWI, G2-2Pstl-Ec6Bl, G5d-1 

5 Pstl, and G5d-1 Pstl-EcoW. were isolated from a human genomic DNA library 

(Figures 22A-22D). These genomic fragments all hybridized with both the 
l-9a and 018 cDNA probes, and they ranged in size between 1.5 kb and 3 
kb. Partial nucleic acid sequence information demonstrated homology 
between G2-2PJ/! and the human Reg gene and human and rat FTP cDNAs 

10 (Figure 23A); between G2-2 Pstl-EcoVl and both the Reg gene and rat FTP 

cDNA (Figure 23B), and also with AD 2-2, AD 3-4, and the l-9a cDNAs 
(data not shown); between G5d-1 Psti. and the Reg gene and human PTP 
(Figure 23C); and between G5d-1 Pstl-EcdKL and Reg gene, human PTP, 1- 
9a, and AD 4-4. 

15 Example 8 

In vitro Expression of the LacZ Fusion Protein and Demonstration of Us 
Relatedness to Thread Proteins 

Fusion protein expression in bacteria containing the l-9a cDNA clone, 
or one of the four genomic clones was induced with isopropylthio-fi-D- 

20 galactoside OPTO) using standard techniques (Sambrook et al. , supra). Crude 

bacterial lysates from induced and uninduced .cultures were subjected to SDS- 
PAGE and Western blot analysis using the Th9 monoclonal antibody to thread 
protein (Sasaki etal, J. Biol Chem. 268:1-4 (1993)). and m -l labeled protein 
A to detect the bound antibody. In addition, bacterial lawns containing cloned 

25 DNA were induced to express the fusion protein with IPTG, and replica filters 

were probed directly with Th9 monoclonal antibody followed by m -I labeled 
protein A. 
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Thiead protein immunoieactivity was demonstrated in the bacterial 
fusion proteins by direct antibody binding to the IPTG-induced colonics 
(Figures 24A-24D). Thread protein immunoreactivity was detected using a 
cocktail of Th9, Th7, and TblO monoclonal antibodies to FTP (Sasaki et al, 
5 J. Biol Chem. 268:1-4 (1993). and l2S -I labeled Protein A. 

Example 9 

Relative Levels of AD J 6c mRNA in AD and Aged Control Brains 

Northern blot analysis was performed using an AD16 cDNA probe. 
The blots were re-probed to detect 18s ribosomal RNA to evaluate loading of 

10 RNA in each lane. The unsaturated autoradiograms were subjected to 

densitometric analysis using a Molecular Dynamics Image Analyzer. The 
ratios of the AD16c and 18s RNA hybridization signals were plotted for each 
case, and the results are depicted graphically in Figures 25A and 25B. The 
mean ratios (relative levels of AD16c) with standard errors arc depicted in the 

15 smaller right hand graph. The findings confirm that there are elevated levels 

of AD16c mRNA expression in 6 of 9 AD brains compared to 1 of 6 age- 
matched controls. The difference between the mean levels is highly 
statistically significant (P < 0.005). Similar results were obtained using AD10- 
7 probes. Theses results demonstrate that there is a statistically significant 

20 increase in levels of expression in AD brains compared to control brains. 

Example 10 

Preparation of Recombinant AD10-7 Fusion Protein and Detection 
Thereof With Monoclonal Antibodies 

AD10-7 cDNA was ligated into pTrcfflS vectors (In Vitrogen, San 
25 Diego) in three different reading frames (two incorrect-A and B, and one 

correct-C). Bacteria transformed with one of the three plasmids were induced 
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with IPTG and bacterial lysates were examined for protein expression 0, 1 and 
5 hours later. The proteins were fractionated by SDS-PAGE, and Western 
blot analysis was performed using monoclonal antibodies against the expressed 
tag protein (T7-tag mouse monoclonal antibodies; Novogcn). The blots were 

5 developed using the avidin-biotin, horseradish peroxidase method, with 

iUaminobenzidine as the chromogen (Figure 26). A band corresponding to 
-45 kDA was detected in bacteria that had been transformed with plasmid 
DNA which contained AD10-7 ligated only in the correct reading frame (C) 
(arrow). The same size protein was observed by in vitro translation of the 

10 AD10-7 cDNA in a rabbit reticulocyte lysate assay system. In both systems, 

the fusion partner peptide was ~3 kDA, indicating that the cDNA encodes a 
protein of about -42 kDA. A -42 kDA NPT species is routinely detected 
by Western Blot analysis of neuronal cell lines and of human brain tissue. 

Example 11 

15 Demonstration of Neuronal Localization of AD10-7 mRNA Expression by 

In Situ Hybridization 

Sense and antisense cRNA probes were generated from linearized 
AD10-7 plasmid DNA using SP6 or T7 DNA-dependent RNA polymerase, 
respectively. The antisense probes hybridized with neuronal cell line mRNA 

20 as described above for this clone. The cRNA sense probes, on the other 

hand, failed to hybridize with RNA by Northern blot analysis. cRNA probes 
labeled with digoxigenin-UTP were hybridized with human brain tissue 
sections from early AD. After washing the sections extensively (de la Monte 
et at., J. Clin. Invest. 56:1004-1013 (1990)), the hybridized probes were 

25 detected using peroxidase or alkaline phosphatase conjugated monoclonal 

antibodies to digoxigenin, and the colorimetric reactions were revealed using 
standard methods. Examination of the sections by brightfield and darkfield 
microscopy demonstrated hybridization of AD10-7 only in neurons (Fig. 27; 
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dense aggregates of white grains over cell bodies in (Fig. 27A)). In contrast, 
and similar to the findings by Northern blot analysis, the sense AD10-7 cRNA 
probes failed to hybridize with brain tissue (Fig. 27B). 

Although the foregoing refers to particular preferred embodiments, it 
will be understood that the present invention is not so limited. It will occur 
to those of ordinary skill in the art that various modifications may be made to 
the disclosed embodiments and that such modifications are intended to be 
within the scope of the present invention, which is defined by the following 
Claims. 

Example 12 

Levels ofNTP Expression in AD and Other Neurodegenerative Diseases 

a. Cellular localization and accumulation of NTP immunoreactivity in 
AD brains demonstrated by immunohistochemistry 

NTP immunoreactivity was detected in histological sections with the 
high affinity Th9 monoclonal antibody to PTP, the pancreatic form of thread 
protein. Although the PTP MoAbs cross-react with NTP, NTP is distinct 
since it differs in molecular mass, and many antigenic epitopes present in PTP 
are not shared with NTP. Moreover, hybridization studies demonstrated NTP 
mRNA transcripts in neurons, indicating that NTP is synthesized in brain. 
NTP immunoreactivity is localized in neuronal perikarya, and in neuropil and 
white matter fibers. In AD, the density of NTP immunoreactive neurons, and 
intensity of immunoreactive staining are increased relative to intact aged 
control brains. Increased neuronal labeling in AD was detected in the cerebral 
cortex and subcortical nuclei. Within the neocortex, NTP immunoreactivity 
was primarily distributed in Layers III, V, and VI. Although NTP 
immunoreactivity was detected in neurons with neurofibrillary tangles or 
granuole vacuolar degeneration, numerous neurons without overt 
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neurodegenerativc changes also expressed high levels of NTP. NTP 
immunoreactivity was not distributed in plaques or extracellular neurofibrillary 
tangles, and dystrophic neurites were not selectively labeled (de la Monte & 
Wands, J. Neurol. ScL 1/5:152-164 (1992); Ozturk, M. etal. Proc. Natl. 
5 Acad. ScL 32:5627-5631 (1985)). 

b. Semiquantitative estimates of NTP expression in AD and other 
neurodegenerative diseases 

A detailed quantitative immunohistochemical analysis of NTP 
expression was performed using matched paraffin-embedded blocks of 

10 different brain regions from patients with AD (N =25), AD plus Parkinson's 

disease (PD) (N=8), PD, AD plus Down's syndrome (AD+DS) (N=6), or 
no neurological disease (aged controls) (N=21). In addition, brains with 
Huntington's disease (N=5) and multi-infarct demenUa (N=2) were studied 
as disease controls. The highest mean densities of NTP immunoreactive 

15 neurons were observed in AD and AD+DS, followed by AD+PD, then PD. 

The AD+PD brains had less severe AD lesions compared with AD, and PD 
had relatively few AD lesions. In contrast to neurofibrillary tangles and 
plaques which showed striking regional variation, the densities of NTP 
immunoreactive neurons were relatively uniform in different neocortical 

20 regions. Elevated levels of NTP immunoreactive expression were detected in 

PD dementia, but the levels were much lower than in AD or AD+PD (de la 
Monte & Wands, J. Neurol. ScL JJ3:152-164 (1992)). The finding of high 
densities of neurofilament immunoreactive dystrophic dendrites in PD 
dementia, similar to AD, suggested that AD histopathologic^ lesions may 

25 have been evolving at the time of death, and thus account for the modestly 

elevated levels of NTP in these cases. Specificity of the elevated NTP gene 
expression in AD was corroborated by the absence of increased NTP 
immunoreactivity in brains with Huntington's disease or multi-infarct 
dementia. However, in the vicinity of subacute cerebral infarction in both 
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control and diseased brains, NTP immunoreactivity was strikingly increased 
in viable appearing neurons, while in the setting of healed infarction, NTP 
immunoreactive expression was not elevated. Thus, NTP gene expression can 
be modulated by neuronal injury with attendant reparative or regenerative 
sprouting. 

c. Demonstration of elevated NTP levels in AD brain tissue by a 
quantitative radioimmunoassay 

A highly sensitive three-site monoclonal antibody-based 
immunoradiometric assay (M-IRMA) was developed to measure NTP 
concentrations in biological fluids and tissue homogenates. The M-IRMA was 
developed using the Th7, Th9, and ThlO MoAbs to PTP, which are 
cross-reactive with NTP in tissue sections and brain homogenates. Briefly, 
the Th7 and ThlO antibodies, bound to polystyrene beads, were used to 
capture NTP in brain homogenates or biological fluids. The captured antigens 
were detected using 115 I-labeled Th9 high affinity MoAb. The concentrations 
of NTP present in the samples were computed from a standard curve 
generated with different amounts of purified PTP. This highly sensitive assay 
detected as little as 10 pmol of thread proteins. 

Homogenates of fresh frozen brain tissue sampled immediately adjacent 
to the blocks taken for histological and immunohistochemical staining, were 
used to measure NTP concentrations. The tissue was homogenized in 
phosphate buffered saline (0.85% NaCl, 10 mM phosphate, pH 7,4) plus 
protease inhibitors. The supernatant fractions obtained after centrifiigation at 
12,000 x g was used to measure NTP concentration by M-IRMA. Note that 
this gentle extraction procedure excluded membrane bound and insoluble NTP 
which may have been present in the tissue. However, the Th MoAbs bind to 
conformational rather than linear epitopes, and immunoreactivity was lost to 
unpredictable degrees in specimens that had been extracted with denaturing or 
reducing reagents. Using M-IRMA, significantly high levels of NTP were 
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detected in AD cerebral tissue compared with corresponding regions of intact 
aged control brains. In addition, the concentrations of NTP in AD+DS and 
AD+PD were significantly elevated relative to control and PD. In AD, 
elevated levels of NTP were detected in all regions of cerebral cortex and in 
5 subcortical nuclei, and the degree of increased expression was correlated more 

with the patterns of NTP immunohistochemical staining in neurons than with 
the distribution of neurofibrillary tangles and plaques (de la Monte & Wands, 
/. Neurol ScL 113: 152-164 (1992)). 

d. Detection of NTP in cerebrospinal fluid 

10 NTP immunoreactivity was also detected in choroid plexus and 

ependymal epithelial cells, although corresponding mRNA expression has not 
been observed. NTP was assayed directly in cerebrospinal fluid (CSF) 
samples using M-IRMA. To determine the molecular mass of NTP present 
in CSF and demonstrate its distinctiveness from PTP, which is present in high 

15 concentrations in blood, CSF samples containing 100 ^g of protein were 

fractionated by SDS-PAGE, and proteins eluted from the gel fractions were 
assayed for NTP by M-IRMA as described above. Unlike brain tissue in 
which several different size NTP-related molecules may be expressed, the only 
species of NTP detected in clear CSF samples had an Mr of -21 kD. In 

20 contrast, hemorrhagic samples contained a dominant 21 kD peak, and another 

14 kD peak, probably corresponding to PTP. In several samples of AD CSF 
or ventricular fluid, 21 kD NTP molecules could also be detected by 
immunoprecipitation followed by Western blot analysis using monoclonal or 
polyclonal Th antibodies. 

25 e. Demonstration of elevated levels of NTP in AD CSF 

NTP concentrations were measured in paired postmortem samples of 
clear ventricular fluid (VF) and temporal lobe neocortex using M-IRMA. The 
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concentrations of NT? in postmortem ventricular fluid from patients with 
histopathologically proven AD were significantly elevated compared with the 
levels in similar specimens from aged control patients. In addition, the 
concentrations of NT? measured in ventricular fluid were positively correlated 
with the levels of NT? in cerebral tissue, such that the mean values were 
nearly identical. Intact aged control and PD brain and VF samples contained 
low levels of NT?. As observed in histological sections, in the setting of 
subacute cerebral infarction or non-specific injury, during an interval when 
regenerative neuronal sprouting would be expected, the levels of NT? 
measured in both cerebral tissue and VF were also elevated. However, in 
specimens from patients with remote cerebral infarcts and multi-infarct 
dementia, no elevation of NT? was detected in either brain tissue homogenates 
or VF samples (de la Monte, S.M. et al. 9 Ann. Neurol 52:733-742 (1992)). 

/. Detection of elevated levels of NTP early in the course of AD 
dementia 

A large clinically-based study was conducted to determine whether the 
concentrations of NTP in CSF of patients with early manifestations of AD 
were significantly elevated compared with neurological disease (PD, Multiple 
sclerosis-MS) and non-demented control patients (back pain, cervical 
spondylosis, depression, headache, psychosis). NTP concentrations were 
measured by M-IRMA, and the assays and data analysis were conducted under 
code. In patients with clinically diagnosed AD, with confirmed follow-up 6 
to 10 years later, the concentrations of NTP were significantly elevated 
compared with both the non-demented and neurological disease control 
patients (Table 1) (de la Monte, S.M. etal, Ann. Neurol. 32:733-742 
(1992)). Comparison of antemortem early AD CSF levels with postmortem 
temporal neocortex and ventricular fluid end-stage AD levels demonstrated 
striking increases in mean NTP concentration with progression of disease, as 
opposed to no significant change over the same intervals in aged control 
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samples. Moreover, in paired samples from 9 patients with AD, the 
concentrations of NTP in postmortem brain and CSF were 5- to 50-fold higher 
rtan in corresponding antemortem CSF samples obtained approximately 6 
years earlier. These findings demonstrated that NTP levels were significantly 
5 elevated in CSF of individuals with AD, and that NTP levels in CSF may 

increase strikingly with progression of dementia and neuronal degeneration. 
Thus, elevated concentrations of NTP in CSF can serve as an in vivo marker 
for AD neuronal degeneration. 



Table 1: Elevated NTP Levels in Antemortem CSF in Early AD 
10 Dementia 





AD 


PD 


MS 


Control 


P-value 


No. of Patients 


84 


45 


73 


73 




Age (years) 


76 


61 


41 


55 


< 0.01 


Blessed Score 


15 


5 


ND 


ND 


< 0.001 


CSF NTP (ng/ml) 


4.2 


1.9 


1.6 


1.3 


< 0.001 



15 Example 13 

Cloning of the human brain cDNA encoding NTP 

a. Strategy for Isolating NTP cDNAs 

Using probes prepared with a rat PTP cDNA, a single 1.4 kB mRNA 
transcript was detected in AD and DS brains (de la Monte, S.M. et aL f J. 
20 Clin. Invest. 56:1004-1013 (1990)). However, low stringency hybridization 

with either rat or bovine PTP cDNA probes revealed 4 distinct 
cross-hybridizing NTP transcripts in human brain. To isolate NTP cDNAs, 
several human brain cDNA libraries were screened using probes derived from 
the 3' half of the rat PTP cDNA. Clones were selected for further study 
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based upon positive Southern blot analysis with probes derived from the 5' 
half of rat FTP. In addition to nucleic acid sequence analysis, final clone 
selection was based upon detection of the appropriate size mRNA transcripts 
in pancreas and human brain. An incomplete probable NTP cDNA initially 
isolated from an AD brain library, was then used to re-screen the AD brain, 
as well as a 17 week human fetal brain library to obtain full-length and other 
related cDNA clones. 

b. Characteristics of the AD7c-NTP cDNA isolated from an AD brain 
library 

The AD7C-NTP clone is a 1.39 kB complete cDNA that encodes a 
protein with a predicted Mr of 39 kDa (SEQ ID NO:120 and 121, nucleotide 
and amino acid sequence, respectively). The cDNA contains an AUG start 
codon, 1140 bp of continuous open reading frame, and a 250 bp 3' 
untranslated segment, followed by an AATAAA poly adenylalion signal. One 
unusual feature of the AD7c-NTP cDNA is that it contains a tandemly 
repeated head-to-tail dimer of a 570 bp sequence within the coding region. 
The deduced amino acid sequence of the 570 bp sequence is 45% homologous 
with human PTP. Importantly, there is conservation of the positions of 5 of 
the 7 Cys residues, a feature that appears to be characteristic of thread 
proteins (Lasserre, C. etal., Cancer Res. 52:5089-5095 (1992)). The 
AD7c-NTP protein contains a hydrophobic leader sequence with a potential 
cleavage at amino acid residue #15, and multiple Ser and Thr phosphorylation 
motifs. Correspondingly, several NTP molecules expressed in primitive 
neuroectodermal tumor cell lines (PNET1 and PNET2), and in SH-Sy5y 
neuroblastoma cells are phosphorylated by insulin stimulation or by activation 
protein kinase C (see below). The translated AD7c-NTP protein also has 
numerous hydrophilic domains. 



WO 96/15272 



PCTVUS95/17111 



85 



c. Tissue distribution ofAD7c-NTP mRNA by Northern blot analysis 

Northern blot analysis was performed using 15 fig samples of total 
RNA extracted from adult human brain, kidney, liver, spleen, gastrointestinal 
tract (various regions) ovaries, fallopian tubes, uterus, thyroid, lung, skeletal 

5 muscle, and pancreas, and from adult rat brain, kidney, liver, spleen, 

gastrointestinal tract (various regions), testis, thymus, lung, skeletal muscle, 
and pancreas. Random primer generated [32P]dCTP-labeled DNA probes, 
prepared with the AD7c-NTP cloned insert as the template, hybridized under 
highly stringent conditions with RNA from human and rat brain and pancreas. 

10 Cross-hybridization signals were not detected in the other organs and tissues. 

In the pancreas, the AD7c-NTP probes hybridized with 0.9 kB transcripts, 
corresponding with the size of FTP. In adult human brain, the AD7c-NTP 
hybridized with 1.4 kB and 0.9 kB mRNA transcripts. In adult rat brain, the 
AD7c-NTP probes hybridized with 0.8 kB transcripts. 

15 rf# Expression of mature and fetal brain forms of AD7c-NTP-related 

mRNA transcripts 

Northern blot analysis demonstrated 5 distinct AD7c-NTP-related 
mRNA transcripts. Two of the mRNA transcripts (3.2 kB and 1.9 kB) were 
mainly expressed in fetal brain and neoplastic neuronal cells, e.g. primitive 

20 neuroectodermal tumors cell lines, while the other three (1.4 kB, 1.2 kB, and 

0.8 kB) were primarily expressed in postnatal developing and mature brains. 
With increasing age, there was a progressive decline in the steady-state levels 
of all AD7c-NTP-related mRNA transcripts, and a shift toward exclusive, 
very low-level expression of the 0.8 kB transcript in the adult rat brain. The 

25 major decline in postnatal NTP gene expression was between days 1 and 8, 

coincident with the reduction in development-associated cortical neuritic 
• sprouting. 
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e. Demonstration ofAD7c-NTP mRNA up-regulated expression in AD 
brains 

AD7c-NTP mRNA expression was examined in AD and aged control 
brains by Northern blot analysis. RNA was extracted from matched samples 
of frontal lobe neocortex (Brodmann Area ll) t and 15 fig of total RNA were 
fractionated. Using random primer generated DNA probes, two 
AD7c-NTP-related mRNA transcripts, 1.4 kB and 0.9 kB were detected in 
both AD and control brains. Quantitative assessment of the levels of 
expression was made by volume densitometric analysis of unsaturated 
autoradiograms. After correcting for differences in sample loading, based 
upon corresponding 18s ribosomal RNA hybridization signals (obtained by 
re-probing the blots with a 30mer corresponding to 18s RNA), it was 
determined that the steady state levels of both the 0.9 kB and 1.4 kB 
AD7C-NTP mRNA transcripts were elevated in most of the AD brain 
samples. In addition, the mean level of AD7C-NTP mRNA in AD brains was 
two-fold higher than in aged control brains (P<0.01). 

f. Cellular localization of AD7c-NTP mRNA expression by in situ 
hybridization 

In situ hybridization was used to demonstrate cellular localization of 
AD7c-NTP-related mRNA transcripts. Antisense and sense cRNA probes 
were prepared from linearized AD7c-NTP cDNA template, and purified from 
polyacrylamide gel. cRNA probes labeled with [ M P]-UTP were used in 
Northern blot analysis to demonstrate specificity of hybridization with 
antisense, and absence of hybridization with sense probes. In situ cRNA 
probes were labeled with digoxigenin-UTP. Hybridized probes were detected 
with alkaline phosphatase- or horseradish peroxidase-conjugated 
anti-digoxigenin antibodies, and BCIP/NBT substrate. AD7c-NTP gene 
expression was detected in cortical neurons of both AD and control frontal 
(Brodmann Area 1 1) and temporal (Area 21) neocortex using antisense cRNA 
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probes. Hybridization signals were not detected in white matter or glial cells, 
nor in tissue hybridized with sense cRNA probes (negative control). 

g. In vitro translation and expression of the AD7c-NTP clone 

Sense and antisense RNA transcripts were incorporated into rabbit 
reticulocyte lysate in vitro translation assays, and the products analyzed by 
SDS-PAGE. A single 39 kD protein was generated by translation of sense 
strand cRNAs. SDS-PAGE analysis of AD7c-NTP recombinant fusion 
proteins generated in a pTrcHis expression vector (InVitrogen) also 
demonstrated the translated product to be -39 kD. Western blot analysis of 
AD7c-NTP recombinant proteins demonstrated positive immunoreactivity with 
polyclonal antibodies to FTP, under non-reducing conditions. 

h. Polyclonal antibodies to AD7c-NTP are immunoreactive with PTP, 
and polyclonal anti-PTP is immunoreactive with recombinant 
AD7c-NTP protein 

Rabbit polyclonal antibodies were generated to the AD7c-NTP-pTrcHis 
recombinant protein. The immunoglobulin fraction was precipitated with 
ammonium sulfate and dialyzed against PBS. In a radioimmunoassay, the 
polyclonal antibodies were specifically immunoreactive with the recombinant 
AD7C-NTP at greater than a 1:100.000 dilution of serum. Western blot 
analysis was performed under non-reducing conditions since the antibodies 
may recognize both conformational and linear epitopes. Both anti-AD7c-NTP 
and anti-PTP exhibited positive immunoreactivity with recombinant 
AD7c-NTP protein and purified PTP. However, the intensity of 
cross-reactivity was comparatively low-level for each antibody. 
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L Tissue and cellular distribution of anti-AD7c-NTP immunoreaetivity 

Western blot analysis demonstrated binding of anti-AD7c-NTP with 
pancreas and brain. The protein recognized in the rat pancreas was 17 kD, 
the same as detected with anti-PTP. Several low intensity AD7c-NTP- 
immunoreactive bands were detected in adult rat brain, but the dominant 
species was -39 kD. All other rat organs were negative. The same 
distribution of immunoreaetivity was observed with polyclonal anti-PTP, but 
with brain, the binding intensity was low-level, and most of the bands detected 
with the AD7c-NTP antibodies were not observed with the FTP antibodies. 
Immunocytochemical staining demonstrated positive immunoreaetivity in 
PNET cells of neuronal phenotype, and in neurons, neuropil fibers, and axons 
of mature human brain. Glial cells were not immunoreactive with anti-AD7c- 
NTP. 

/ AD7c-NTP immunoreaetivity in AD brain 

Studies using postmortem brain tissue demonstrated more abundant and 
greater intensities of cortical neuron labeling in AD (N=5) compared with 
intact aged control (N=5) brains. Studies were conducted to examine the 
molecular sizes of AD7c-NTP-related proteins expressed in AD and aged 
control brains by either direct Western blot analysis, and by 
immunoprecipitation followed by Western blot analysis with the same 
antibodies. 21 kD, 26 kD, and 39 kD AD7c-NTP-related molecules were 
found in AD (N=6), control (N=7), and infant Down' syndrome (N=l) 
brains, but higher levels of the 21 kD NTP protein were found in AD relative 
to control. Although the same size bands were detected with polyclonal 
anti-PTP, the sensitivity was low, and the relative intensities of the bands 
were different. For example, the 21 kD and 39 kD NTP molecules were 
more clearly detected with the AD7c-NTP antibodies. In addition, these 
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studies demonstrated abnormal size AD7c-NTP-related bands in several AD 
brains. 

k. Characteristics of the large library of MoAbs generated to 
recombinant AD7c-NTP protein 

5 150 mouse MoAbs were generated to the AD7c-NTP recombinant 

protein. The hybridoma supematants were screened by western blot analysis 
and immunohistochemistry using AD and control brain, and radioimmunoassay 
of recombinant AD7c-NTP. With another radioimmunoassay, the MoAbs 
were also screened against FTP. The objectives of these studies were to do 

10 the following: 1) select high affinity antibodies that recognize 

AD7c-NTP-related proteins (NTP), but not FTP; 2) identify antibodies that 
recognize or bind to neurons in AD brains to a greater extent than in control 
brains; and 3) determine which antibodies bind to molecules in brain tissue or 
CSF that are the same size as recombinant AD7c-NTP. These reagents enable 

15 specific detection of elevated levels of NTP in brain tissue, and also in CSF. 

Analysis of 25 representative MoAbs demonstrated several with 
cross-reactivity between PTP and AD7c-NTP, but most with strong binding 
only to AD7C-NTP. Western blot analysis confirmed high level binding of all 
25 MoAbs with recombinant AD7c-NTP. In addition, 6 antibodies were 

20 identified that recognized precisely the same size molecules in brain as 

detected in the fusion protein, 6 others that recognized slightly high molecular 
weight molecules in brain, and 10 with low-level or absent binding in brain. 
The three remaining antibodies recognized completely different size bands in 
brain compared with AD7c-NTP itself. Immunohistochemical staining studies 

25 demonstrated 5 MoAbs with similar high-level binding in AD and control 

brains. 3 with more intense and widely distributed immunoreactivity in AD 
brains, 6 with low-level binding in both AD and control brains, and 11 with 
little or no binding to histological sections of brain. The degree of binding by 
immunohistochemistry correlated with the findings by Western blot analysis. 
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L A radioimmunoassay to measure levels ofAD7c-NTP-related proteins 
in brain tissue and biological fluids (CSF, serum, urine) 

A M-IRMA has been developed to specifically measure AD7c-NTP 
concentrations in brain, CSF, and blood. Empirical studies have been 
5 performed with a large matrix of MoAbs linked to a solid phase support to 

determine which antibodies were suitable for capture of AD7c-NTP antigen 
in biological fluids. MoAbs were then selected for their high binding 
capability to recombinant AD7c-NTP bound to a solid phase support after 
labeling with l25 I. MoAbs #2 and #5, used in the M-IRMA, were selected 

10 from a panel of 25 MoAbs because of the following characteristics: 1) the 

antibodies were highly reactive to recombinant AD7c-NTP and not PIP when 
bound to a solid phase support; 2) the MoAbs specifically stained neurons in 
AD brains; 3) the MoAbs reacted with a 42 kD species in cell lysates of AD 
brain by Western blot analysis; and 4) the MoAbs were of the IgGl isotype, 

15 and therefore suitable for labeling with 123 I. Finally, competitive inhibition 

experiments were performed to demonstrate that the MoAbs recognized 
separate and distinct antigenic determinants on AD7c-NTP molecules. 

m. Isolation of AD7c-related cDNAsfrom an AD brain library 

In addition to the AD7c-NTP clone, five related but distinct cDNAs 
(AD12-1, AD16b t AD19-1, ADHD, AD16c) were isolated from the AD 
brain library. All 5 cDNAs share either an identical or nearly identical 570 
bp sequence with the AD7c-NTP clone. Each of the cDNAs has been 
subcloned into pTrc-His expression vectors for analysis of the corresponding 
fusion proteins by SDS-PAGE, Western blot, and M-IRMA. Each of these 
fusion proteins was immunoreactive with polyclonal AD7c-NTP antibodies. 
Unique DNA and antibody reagents are made to distinguish expression of the 
corresponding mRNAs and proteins in neuronal cells and brain tissue. The 



20 



25 
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same reagents arc used to analyze function and evaluate expression of distinct 
NTF genes in normal and pathological states. 

n. Isolation of AD7c-NTP-related cDNAs from a human fetal brain 
library 

Five AD7c-NTP-related cDNA clones (FBl-3c, FB1-6C1, FB2-3C2, 
FB2-6C1, FB8-3B2) were isolated from a 17 week human fetal brain library. 
These cDNAs have been partially characterized, and like the AD clones, they 
also contain an identical or neariy identical 570 bp sequence as described for 
AD7C-NTP. The 5 FB cDNAs all hybridized to 3.2 kB and 1.9 kB mRNA 
transcripts in developing rat brains and PNET cell lines. However, the 
cDNAs exhibited different degrees (intensities) of hybridization with mature 
brain, and variability with respect to the number (between 1 and 3) of low 
molecular weight (0.8-1.2 kB) mRNA transcripts detected in immature brain 
and PNET cells. Sequence data analysis suggests that each of the five FB 
clones corresponds with at least one of the cDNAs isolated from the AD brain 
library. 

Example 14 

The biological functions of NIT with respect to developmental regulation 
and cell growth in the CNS 

a. NTP expression is developmentally regulated 

Studies with human brain tissue suggested that NTP expression was 
developmentally regulated. Using the Th9 MoAb to FTP, and ["SJUTP- 
labeled cRNA probes generated with the rat PTP cDNA, NTP gene expression 
was examined in developing and mature rat brains. By in situ hybridization, 
NTP mRNA expression was detected throughout the CNS at embryonic day 
13 (E13). The density of hybridization grains (levels of mRNA expression) 
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increased throughout development and peaked on postnatal day 8 (P8). By 
P16, NTP mRNA expression was low-level and similar to adult brains. With 
regard to NTP immunoreactivity t faint widespread labeling of neuropil fibers, 
and intense focal labeling of ependymal lining cells were observed in E13 
brains. Thereafter, was a rostral-to-caudal wave of neuronal perikaryal NTP 
gene expression, such that olfactory structures were mainly labeled in E15 and 
E17 brains, while cerebellar cortical neurons were primarily labeled in P8 and 
P16 brains. Young adult and aged (>15 mos.) rat brains exhibited low, 
virtually non-detectable levels of NTP immunoreactivity in scattered cerebral 
cortex neurons. 

b. Quantitative assessment of NTP expression in developing rat brain 
using AD7c-NTP polyclonal antibodies 

Inununohistochemical staining studies using AD7c-NTP polyclonal 
antibodies yielded results similar to those obtained with PTP polyclonal 
antibodies. The greater specificity of AD7c-NTP antibodies for brain 
permitted Western blot analysis and quantitation of NTP expression. Western 
blot analysis disclosed 6 different size NTP-related proteins in rat brain: 15 
kD, 17 kD, 21 kD, 26 kD, 39 kD, and 42 kD. Densitometry scanning of the 
autoradiographs revealed progressive declines in the levels of several NTP 
proteins with increasing age. Importantly, like human brain, the 21 kD NTP 
molecules were expressed at high levels during development, and at low levels 
in the mature brain. The same was true for the 17 kD and 39 kD species. In 
contrast, levels of the 26 kD NTP molecules increased with age, while 
expression of the 42 kD species did not appear to be developmental^ 
regulated. 
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c. Aberrantly increased NTP expression in Down syndrome occurs prior 
to the establishment of AD histopathotogy and dementia 

In both control and Down syndrome 19-36 week fetus and infant 
brains, NTP immunorcactivity was widely distributed in neurons, neuropil 
5 fibers, and axons. In control brains, the density of NTP-immunoreactive 

neurons and the intensity of neuropil fiber labeling reduced substantially 
within the first decade, generally by 5 or 6 years of age. Thereafter, NTP 
immunoreactive expression remained low-level in all age groups. Adjacent 
histological sections were immunostained with a cocktail of MoAbs to 

10 neurofilament (SMI31 + SMI32 + SMI34) to delineate the frequency of AD 

lesions. None of the control brains had neurofibrillary tangles or dystrophic 
dendrites, but one elderly control had scattered neurofilament immunoreactive 
plaques. In Down syndrome, neurofilament immunoreactive superficial 
cortical dystrophic neurites (dendrites) developed and proliferated during early 

15 childhood, probably representing one of the earliest histopathological 

manifestations of AD neuronal degeneration. Neurofibrillary tangles were 
first detected between ages 10 and 20 years, while 
neurofilament-immunoreactive plaques were initially detected in the fourth 
decade of life. 

20 In developing Down syndrome brains, NTP immunoreactive expression 

in neuronal perikarya and neuropil fibers increased along with the large-scale 
proliferation of superficial neurofilament-immunoreactive dystrophic dendrites, 
and the appearance of neurofibrillary tangles. With increasing age and 
evolution of AD lesions, NTP expression further increased in Down 

25 syndrome. The increased NTP immunorcactivity was not restricted to neurons 

with neurofibrillary tangles or granuole vacuolar degeneration, nor was it 
localized in plaques. Increased NTP gene expression in Down syndrome 
brains begins at least two decades prior to the establishment of clinical and 
histopathological AD. Thus NTP up-regulated gene expression is an early 

30 marker of AD neuronal degeneration, age and development of AD lesions. 
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d. Demonstration of distinct NTP molecules in developing and mature 
human brains 

The molecular sizes of the NTP molecules expressed in brain were 
determined by SDS-PAGE fractionation of 100 /xg samples of protein, 

5 followed by radioimmunoassay (M-IRMA) of proteins eluted from gel slices- 

The results were graphed with respect to distances migrated by simultaneously 
analyzed molecular weight standards. Regardless of age, the dominant NTP 
species detected in brain was 21 kDa. In AD, with or without underlying 
Down syndrome, small 39 kDa and 26 kDa NTP peaks were also detected. 

10 In both control and Down syndrome infant brains, a prominent 17 kD NTP 

peak was also detected. 

e. Development of an in vitro model to examine NTP expression during 
growth and differentiation 

Of the human primitive neuroectodermal tumor (PNET) cell lines- 
15 PNET1, PNET2, and SH-Sy5y-PNET1 and PNET2 were of CNS PNET 

origin, while SH-Sy5y cells were derived from a neuroblastoma. All primary 
human PNET brain tumors have been found to express NTP. The 3 PNET 
cell lines used were demonstrated to have neurofilament, synaptophysin, and 
GAP-43 immunoreactivities, confirming their neuronal phenotypes. In 
20 addition, PNET2 and SH-Sy5y cells undergo neuronal differentiation with 

neuritic sprouting, increased synaptophysin expression, and decreased 
vimentin expression following treatment with retinoic acid, insulin, or phorbol 
esther myristate (PMA). In contrast, the PNET1 ceils are highly primitive, 
and fail to exhibit growth factor mediated cell growth or differentiation. 

25 /. Characterization of NTP expression in PNET cells 

In all three PNET cell lines, five different NTP species with Mr's of 
39-42 kD, 26 kD, 21 kD, 18 kD, and 15 kD were detected by (1) direct 
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Wcstcrn blot analysis, (2) metabolic labeling followed by 
immunoprecipitation, or (3) radioimmunoassay (M-IRMA). NTP gene 
expression was detected using Th polyclonal or monoclonal antibodies to FTP. 
Unlike adult human brain where the dominant NTP species detected was 21 
5 kD, in PNET cells, the 39 kD, 18 kD, and 15 kD NTP molecules were most 

abundant, while the 21 kD and 26 kD were expressed at low or non-detectable 
levels. Pulse-chase and metabolic labeling studies demonstrated that the 18 
kD and 26 kD species were probably derived from other NTP molecules. 
Additional studies demonstrated phosphorylation of the 39 kD, 26 kD, 21 kD, 

10 and 18 kD NTP molecules. Moreover, tyrosyl phosphoiyiated residues were 

detected in the 39 kD and 18 kD NTP molecules by Western blot analysis of 
the immunoprecipitated proteins. Finally, after stimulation with either PMA 
or insulin, a rapid supershifts in NTP molecular mass from 15 kD to 18 kD 
with incorporation of P J P) orthophosphate as observed in SH-Sy5y and 

15 PNET2 cells. Glycosylation of NTP has not been detected. Therefore, at 

least some of the NTP molecules are likely to be phosphoproteins. 

35 S-Met-labeled NTP molecules were immunoprecipitated using PTP 
Th MoAbs. Rapid labeling of the 39 kD, 21 kD, and 15 kD proteins, with 
subsequent appearance (within 10-30 min) of 26 kD, and 18 kD NTP species 

20 occurred. Thus some NTP molecules can be derived rather than synthesized 

de novo. 

g. Insulin modulation of NTP expression 

Insulin is an important mediator of growth and differentiation in CNS 
neurons. Insulin stimulated differentiation of PNET2 cells was associated with 
25 rapid (within 10 minutes) but transient increases in the levels of the 39 kD, 

18 kD and 15 kD NTP species, followed by sustained increases in synthesis 
and steady state levels of all five NTP species. In contrast, fee failure of 
insulin to induce differentiation of PNET1 cells was associated with absent 
insulin modulation of NTP. Analysis of the signal transduction pathways 
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demonstrated that the insulin-induced up-regulation of NTP molecules in 
PNET2 cells was mediated through phosphorylation of the insulin receptor 
substrate- 1 (IRS-1) and the insulin receptor 0 subunit (TRfls) itself. In PNET1 
cells, the lack of insulin responsiveness was associated with impaired 
insulin-mediated tyrosyl phosphorylation of IRS-1 , but normal insulin receptor 
phosphorylation. Correspondingly, the insulin-stimulated association between 
PI3 kinase and phosphorylated IRS-1 was also impaired in PNET1 cells. In 
essence, impaired insulin-mediated tyrosyl phosphorylation of IRS-1 inPNETl 
cells halted activation of the insulin signal transduction cascade, and 
subsequent events leading to modulated gene (NTP) expression. PNET1 cells 
lacked insulin responsiveness and failed to phosphorylate IRS-1, but insulin 
receptor levels and tyrosyl phosphorylation (PY) of the 0-subunit were intact. 
PNET2 cells responded to insulin stimulation with phosphorylation of IRS-1, 
up-regulation of NTP, and neuronal differentiation. The results were 
confirmed by absent association between PI3 kinase and IRS-l-PY in PNET1 
cells after insulin stimulation. 

h. Phorbol esther myristate (PMA) and retinoic acid (RA) modulate NTP 
expression and neuronal differentiation 

PMA and RA induced PNET2 and SH-Sy5y cells to differentiate into 
neurofilament-positive, GAP-43-positive, vimentin-negative cells with fine 
interconnecting neuritic processes. Following PMA stimulation, there was 
immediate phosphorylation of the 15 kD NTP species, with a supershift in 
molecular mass to 18 kD. After 12 hours of PMA or RA stimulation, 
synthesis of the 21 kD and 26 kD NTP species increased four- to five-fold ( 
followed by intracellular accumulation of these same molecules. At the same 
time, housekeeping gene expression, e.g. GAPDH, was not affected. PMA 
and RA treatment also resulted in a shift from the perikarya to neuritic process 
localization of NTP immunoreactivity. Since the effects of PMA stimulation 
were mimicked by phosphatidylserine plus diolein treatment, and blocked by 
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inhibitors of protein kinase C, expression of the 21 kD and 26 kD NTP 
species can be modulated through the protein kinase C cascade. 

L In vitro stimulation studies 

Neuritic sprouting and neuronal differentiation were induced in PNET2 
and SH-Sy5y cells by insulin, PMA, or RA stimulation- Insulin-mediated 
neuritic growth was associated with increased expression of the fetal brain and 
PNET-dominant forms of NTP (15 kD and 18 kD). In contrast, the PMA- 
and RA-induced neuritic sprouting modulated expression of the 21 kD and 26 
kD NTP species, which are primarily expressed in the mature brain, and 
accumulated in AD brains. Thus, expression of the immature or fetal forms 
of NTP are regulated by mechanisms and growth factors distinct from those 
involved in modulating expression of the 21 kD and 26 kD NTP molecules. 
Therefore, expression of fetal NTP molecules/genes can be mediated through 
the IRS-1 cascade, whereas expression of adult brain/AD-associated NTP 
genes can be regulated mainly through protein kinase C pathways. 

Example 15 

AD7c-NTP gene expression in postmortem bruin tissue from a large 
number of patients with AD, other neurodegenerative diseases, e.g. 
Parkinson's Disease, and no neurological disease (aged controls) 

a. Source of tissue 

Matched snap-frozen, and adjacent formalin-fixed paraffin-embedded 
blocks of cerebral tissue from different brain regions (Table 2) is used to 
analyze AD7c-NTP gene expression. All specimens are obtained from the 
Alzheimer's Disease Research Center (ADRC) Brain Bank located at the 
Massachusetts General Hospital (MGH). Additional fresh tissue is continually 
harvested by the ADRC Brain Bank. The histopathological sections are 
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reviewed for all cases. Routine neuropathologies evaluation includes luxol 
fast blue-hematoxylin and eosin, Bielchowsky silver, and Congo red staining, 
and ubiquitin and neurofilament immunostaining to detect neurodegenerative 
lesions in paraffin-embedded sections. 

TABLE 2: Postmortem Samples for AD7c-NTP Gene Expression Studies 



10 



Diagnosis 



Total Number of Cases 
Frozen Tissue* Paraffin Blocks** Ventricular Fluid 



AD 


50 




50 


50 




PD 


10 




10 


5 




DLBD 


8 




10 


0 




Pick's 


4 




8 


2 




ALS 


0 




4 


0 




Down + AD 


7 




8 


4 




Aged control 


50 




50 


50 





15 



Frozen Tissue Ventricular Fluid 
Assays 

RNAse Protection or RT/PCR 
Western blot analysis 
M-IRMA 



Paraffin Tissue Section Studies 

In situ hybridization 
Immunohistochemistry 



20 



* Brodrnann Areas: 21, ll t 40, 17; amydgala; midbrain, striatum, 
cerebellar cortex, s. cord. 

** Brodrnann Areas: 21, 11, 24, 40, 17; amygdala, hippocampus, 
midbrain, cerebellum, s. cord. 



Tissue Processing 



25 



Frozen tissue blocks (approximately 2 x 2 x 0.5 cm) are divided for 
RNA and protein extraction. RNA is extracted by the Chomczynski-Sacchi 
one-step guanidinium isothiocyanate/phenol method (Chomczynski & Sacchi, 
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Anal. Biochem. 162: 156-159 (1987)) using a commcrciaUy available reagents, 
e.g. RNAzol or TRIzoL The integrity of RNA is assessed by Northern blot 
analysis using glyceraldehyde-3-phosphate dehydrogenase (GAPDH) cDNA, 
and 18s synthetic 30mer (Enoch, T. era/., Mol Cell Biol 6:801810 (1986)) 
probes. Completely degraded samples are not used. Protein assays are 
performed with tissue homogenates prepared in 5 volumes of PBS plus 
protease inhibitor (Sasaki, Y. et a/., 7. Biol Chem. 265:3805-3808 (1993)). 
The supernatant fractions obtained by centrifugation at 12,000 x g for 30 
minutes at 4°C r and pelleted proteins solubilized in 1 % SDS-containing buffer 
are used for Western blot analysis and M-IRMA. Previous studies 
demonstrated that a substantial portion of thread proteins can be contained in 
the pellet fractions due to reduced solubility, particularly in AD brains. 
Protein content is determined by the Lowry (Lowiy, O.H. etal. t J. Biol 
Chem. 193:265-275 (1951)) or BioRad colorimetric assay. Paraffin-embedded 
histological sections of tissue adjacent to the blocks processed for the 
quantitative RNA and protein studies, are used for immunohistochemistiy and 
in situ hybridization studies. 

c. Western blot analysis 

Western blot analysis is employed to determine the size and relative 
abundance of the AD7c-NTP proteins present in brain homogenates. 
Polyclonal AD7c-NTP antibodies is utilized in these studies in order to 
simultaneously detect all NTP species. Both supernatant (soluble) and pellet 
(insoluble) fractions are analyzed. 100 /ig samples of protein is fractionated 
in Laemmli SDS-PAGE gels, transferred to Imobilon (nylon) or ECL Hybond 
membranes, and probed for AD7c-NTP expression using rabbit polyclonal 
antibodies generated to the recombinant fusion protein (Harlow & Lane, 
Antibodies. A Laboratory Manual Cold Spring Harbor Laboratory (1988) 
Cold Spring Harbor, NY). Antibody binding is detected with horseradish 
peroxidase conjugated goat anti-rabbit IgG, and enhanced chemiluminescence 
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reagents (Amersham). Pre-stained molecular weight standards is included on 
each gel. In addition. Western blots generated with different amounts 
(between 0.1 fig and 10 fig) of purified recombinant AD7c-NTP protein in 
each lane, is probed simultaneously to provide a basis for normalizing data 
5 among different experiments. The autoradiograms are subjected to volume 

densitometric scanning to quantitate the steady-state levels of AD7c-NTP 
protein expressed. 

d. lmmunohistochemical Staining 

Paraffin sections (Table 2) are de-waxed in xylenes and re-hydrated 
10 through graded alcohol solutions. AD7c-NTP immunoreactivity is assessed 

using rabbit polyclonal antibodies generated to the recombinant fusion protein. 
Antibody binding is detected by the avidin-biotin horseradish peroxidase 
complex (ABC) method using the Vectastain-Elite kit according to the 
manufacturers protocol. Immunoreactivity is revealed withdiaminobenzidine, 
15 The sections are counterstained with hematoxylin, dehydrated through graded 

alcohols, cleared in xylenes, and preserved under coverglass with pennount. 
The sections are processed in large groups, adhering rigidly to the incubation 
conditions. The results are analyzed under code to determine the presence and 
distribution of AD7c-NTP immunoreactivity. 

20 e. In situ hybridization 

Cellular localization of AD7C-NTP gene expression is assessed by in 
situ hybridization using paraffin-embedded tissue (Table 2). The sections are 
prepared and prehybridized. Antisense and sense (negative control) 
digoxigenin-UTP labeled cRNA probes are generated from AD7c-NTP 
25 plasmid linearized with Kpnl or Xhol , using T7 or SP6 DNA-dependent RNA 

polymerase, respectively. The probes are gel purified to remove free 
nucleotides. The sections are hybridized overnight at 50°C with 200 ng/ml 
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of probe (Lee. M.-E. etal., J. Clin. Invest. 86: 141-147 (1990). After 
extensive washing and RNAse A digestion to destroy single-stranded RNA. 
hybridized probes are detected with alkaline phosphatase conjugated 
anti-digoxigenin, and the antibody binding arc revealed with a suitable 
5 chromogen, e.g. BCIP/NBT. Sections counterstained with hematoxylin and 

preserved with aqueous mounting medium are examined and photographed by 
brightfteld and darkfield microscopy. Adjacent sections are evaluated for 
intactness of RNA by performing in situ hybridization with cRNA probes 
corresponding to GAPDH. 

10 /. RNAse Profection Assay 

RNAse protection assays are used to examine levels of AD7c-NTP 
mRNA expression because, compared with Northern blot analysis, this 
technique is relatively insensitive to the small degrees of RNA degradation 
which frequently exist in postmortem tissue. Samples of 5 or 10 jtg of total 

15 RNA are hybridized with gel purified antisense [ 31 P]UTP-labeled cRNA 

probes (Current Protocols in Molecular Biology. Ausubel et al. Eds., John 
Wiley & Sons, New York, 1994) corresponding to nucleotides 670 to 910 of 
the AD7c-NTP cDNA. As a positive control, the same samples are 
simultaneous hybridized (in the same assay tubes) with identically labeled 316 

20 bp antisense cRNA probes corresponding to exons 5-8 of the GAPDH gene 

(Sabath, D. etal. Gene 91:185-191 (1990)). After hybridization, 
single-stranded RNA are digested with RNAse A and RNAse Tl (Current 
Protocols in Molecular Biology. Ausubel et al. Eds., John Wiley & Sons, 
New York, 1994, and the protected probe fragments analyzed on denaturing 

25 polyacrylamide gels. (RNA and human infant brain RNA are used as negative 

and positive controls, respectively. Non-saturated autoradiograms are 
subjected to volume densitometric analysis to quanthate the hybridization 
signals. 
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g. Alternative method for examining levels of AD7c-NTP mRNA 
Expression 

Reverse transcription/polymerase chain reaction amplification of RNA 
has been successfully used to study gene expression. The advantage of 
5 RT/PCR over RNAse protection is that multiple genes can be studied 

simultaneously, utilizing only 1 fig of total RNA as starting material. Low 
yields of RNA have been problematic in the past, particularly with respect to 
AD brain tissue. RNAse protection assays arc performed on samples with 
abundant RNA. yields, and RT/PCR are performed with all samples. 

10 Messenger RNA are reverse transcribed using oligo-dT and random 

oligonucleotide primers. The cDNAs are amplified with primers that flank the 
sequences contained between nucleotides 670 and 910 of AD7c-NTP. The 
results are analyzed using 1-3% Nusieve agarose gels and ethidium bromide 
staining. In addition, to confirm the authenticity of PCR products, Southern 

15 blot analysis is performed using either [ 3J PJdATP or [fluorescein]dATP 

end-labeled oligonucleotide probes corresponding to internal sequences of the 
amplified DNA segment. The minimum number of PCR cycles required to 
detect AD7c-NTP is determined to ensure the amplified products do not reach 
saturation. 



20 h. Construction of a two- or three-site monoclonal antibody based 

immunoradiometric assay (Af-IRMA) to measure AD7c-NTP 
concentration 

Two- or three-site forward sandwich M-IRMAs are used to measure 
AD7c-NTP concentrations. An assay using the #5 and #2 antibodies which 
25 specifically recognize NTP (not PTP) exhibits greater degrees of binding to 

AD compared with aged control brains by immunohistochemical staining. The 
configuration of the prototype two-site M-IRMA is as follows: the #2 MoAb 
serves as the capture antibody bound to a solid-phase support (0.25" 
polystyrene beads). After incubating the coated beads with brain tissue 
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homogcnatcs or CSF t the captured AD7c-NTP proteins arc detected with 
"^-labeled #5 MoAb, which serves as a tracer. The radioactivity remaining 
on the beads after extensive washing is measured in a gamma counter. The 
signal-to-noise ratios arc calculated, and the concentrations of AD7c-NTP in 

5 the samples, determined from a linear standard curve constructed with 

different amounts of recombinant AD7c-NTP protein. Immunoreactivity is 
measured in 200 /xl volumes of diluted tissue extract, CSF, or serum. The 
lower limit of sensitivity is between 1 and 10 pg per ml of purified 
recombinant AD7c-NTP protein. 

10 Brain protein extracts corresponding to the supernatant (soluble) and 

pellet (insoluble) fractions, and samples of postmortem ventricular fluid are 
assayed in quadruplicate at 1:10, 1:50, and 1:100 dilutions to generate S:N 
ratios within the linear range of the standard curve. The AD7c-NTP protein 
concentrations are measured in brain tissue by M-IRMA, using the forward 

15 sandwich assay described. Other configurations with one- or two-site MoAb 

capture are also used. 

L Characterization of additional anti-AD7c-NTP MoAbs 



The M-IRMA is optimized in terms of sensitivity and specificity for 
detecting NTP molecules that accumulate in AD brain tissue. The hybridomas 

20 are screened by evaluating the extent of immunorcactive binding in solid phase 

support immunoassays, Western blot analysis, immunohistochemical staining, 
as described above with the first 25 AD7c-NTP MoAbs. For the solid phase 
support immunoassay, 50 ng of recombinant AD7c-NTP protein are bound to 
polypropylene surfaces in 96-well plates. Hybridoma supernatant are reacted, 

25 and antibody binding detected using l25 Mabeled goat anti-mouse IgG. 

Specificity for AD7c-NTP is assessed by demonstrating absent binding to 
purified PIT, and significant binding above background levels generated with 
nonrelevant hybridoma supernatant. Indirect Western blot analysis are 
performed using recombinant AD7c-NTP protein and human AD brain 
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homogenates, and immunorcactivity is detected with ECL reagents. Western 
blot analysis permits rapid comparison of the relative sizes of the recombinant 
protein with the AD7c-NTP-related molecules expressed in brain, 
Immunohistochemical staining of Brodmann Area 1 1 in the frontal lobe using 
neet hybridoma supernatant is performed to demonstrate the distributions and 
relative levels of AD7c-NTP MoAb immunorcactivity in corresponding 
sections of AD and control brains. Immunohistochemical screening permits 
the identification of AD7c-NTP MoAbs with specific immunoreactivity in 
neurons, and greater degrees of binding in AD compared with aged control 
brains. 

Example 16 

Use of M-IRMA to compare the levels of AD7c-NTP protein in AD, aged 
control, and neurological disease control samples of postmortem 
ventricular fluid and antemortem CSF 

a. Source of specimens 

Postmortem ventricular fluid samples from patients with AD, PD, 
Down syndrome, and normal aging have been obtained from the ADRC-MGH 
brain bank (Table 2). Approximately 400 clinical cerebrospinal fluid (CSF) 
samples (Table 3) from patients with AD f PD, multiple sclerosis, 
non-dementing psychiatric disease (controls), minor neurological ailments, e.g. 
back pain (controls), or multi-infarct dementia have also been banked. These 
samples represent discarded clinical material from previously approved 
studies, or samples obtained for diagnostic purposes. The AD and PD 
samples were obtained from patients enrolled in the ADRC, and therefore 
long-term clinical follow-up, including autopsy in many instances, is available. 
The psychiatric case samples were obtained from patients hospitalized at the 
Bedford VA Hospital. These samples also represent discarded specimens 
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from previously approved clinical studies, and long-term clinical follow-up 
data has already been obtained and incorporated into a computerized database. 

TABLE 3: Clinical CSF Samples for M-IRMA 

Di agnosis No. of Cases Dementia 

5 AD 
PD 

Multiple Sclerosis 
Multi-infarct dementia 
Psychiatric 
10 Control 



154 
56 
75 
5 

28 
94 



Yes 
Yes 
No 
Yes 
No 
No 



M-IRMA Assay to measure AD7c-NTP in ventricular fluid and CSF 
samples 

The samples are diluted 1:10, 1:50, and 1:100 in PBS/BSA, and 
assayed in quadruplicate for AD7c-NTP as described above. Samples yielding 
15 results with signal:noise (S:N) ratios that do not fall within the linear range 

of the standard curve generated with purified recombinant AD7c-NTP protein 
are re-assayed at higher or lower dilutions as required. Results are expressed 
with respect to volume and protein concentration. 

c. Analysis of the size of AD7c-NTP present in ventricular fluid and 
20 CSF 

The molecular size of the AD7C-NTP molecules present in CSF or 
ventricular fluid is determined by Western blot analysis. These studies 
determine whether the levels of the 21 kD or another NTP species are 
increased in AD compared with aged control CSF and ventricular fluid 
25 specimens. Samples containing 100 /ig of protein are fractionated in Laemmli 
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SDS-PAGE gels with molecular weight standards. For comparison with 
results obtained using polyclonal antibodies, the Western blots are re-probed 
with the tracer (detection) AD7c-NTP MoAb used in the M-IRMA. 
Additional planned efforts to generate MoAbs that bind to specific and unique 
NTP molecules will abrogate the need to perform Western blot analysis, in 
order to interpret abnormally elevated levels of AD7c-NTP. Complete 
analysis of samples will be possible with a series of rapid and simple 
M-IRMAs. 

Example 17 
Analysis of levels ofAD7c-NTP in serum of AD 

Approximately 100 AD anlemortem serum samples, representing 
discarded specimens from previously approved clinical studies, are available. 
In addition, approximately 250 samples of serum from normal individuals, 
obtained from previously approved, unrelated studies are also available. 
Finally, postmortem serum from patients with confirmed AD has been banked 
by the ADRC-MGH, and is available. Studies are conducted to determine 
whether AD7c-NTP molecules are detectable in serum by Western blot 
analysis. Although some of MoAbs generated to recombinant AD7c-NTP 
cross-react with PTP, which is abundantly present in serum, the antibodies 
selected for M-IRMA exhibit no detectable binding to purified PTP at 
concentrations as high as 1 mg/ml. The objective is to determine whether 
AD7c-NTP can be measured in serum samples, and whether elevated levels 
of AD7c-NTP detected in postmortem ventricular fluid, brain tissue, or CSF 
are detectable in paired serum samples from the same individuals. Another 
objective is to determine whether the levels of AD7c-NTP are elevated in sera 
from probable and definite AD patients for whom CSF samples are not 
available. Sera diluted 1:10, 1:50, 1:100, and 1:250 in PBS/BSA are assayed 
for AD7c-NTP by M-IRMA. 
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Example 18 

Additional AD and fetal brain 
NTPcDNAs 



Data suggest that the NTP molecules expressed in such non-AD related 
5 diseases, arc 15 kD or 17 kD, rather than 21 kD in size. A family of NTP 

cDNAs was isolated from AD brain, and 17 week human fetal brain (FB) 
libraries. Analysis of the nucleic acid sequences indicates clear regions of 
extreme homology or identity, as well as unique domains in each clone. 

Both strands of each clone are sequenced by the dideoxynucleotide 

10 chain termination method using T7 polymerase and custom oligonucleotide 

primers, with incorporation of deazo-G analogue nucleotides to help 
circumvent compression artifacts. The nucleic acid and amino acid sequences 
are compared with the AD7c-NTP cDNA, and with the published sequences 
of non-neural thread proteins (FTP and HIP). 

15 mRNA expression of the different NTP cDNAs is examined using 

RT/PCR technology with primers designed to amplify 200 to 500 bp unique 
sequences contained within each cDNA. cDNA fragments amplified from 
plasmid templates are directionally subcloned into pGEM vectors to generate 
digoxigenin-labeled cRNA probes for in situ hybridization studies. To quickly 

20 assess potential function, RT/PCR and in situ hybridizations arc conducted on 

a battery of test samples (Table 4). NTP gene expression is quantitated by 
Southern blot analysis of PCR products, relative to the levels' of 
simultaneously amplified GAPDH. 
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TABLE 4: Samples for Surveying Expression of Different NTT Genes 



Mature Brain 


Developing Brain 


PNET2 and SH-SySy cells 


1. AD (N«4) 


1. 2nd trimester fetus 


1. overnight serum starved 


(N-2) 




2. normal aging (N=4) 


2. 3rd trimester fetus 


2. proliferating 




(N-2) 




3. Acute stroke + intact 


3. infant (N=2) 


3. retinoic acid-induced 


tissue (N«3) 




differential 


4. Remote stroke 4- intact 


4. 5 year old (N=2) 


4. vehicle-treated negative 


tissue (N=2) 




control 




5. 16 year old (N=l) 





Methods to Examine NTP Gene Expression: 



RNA Studies 

In situ hybridization 
RT-PCR or RNAse Protection 

Protein Methods 

Immunocytochcmistry 
Western Blot analysis 

Metabolic Labeling/Immunoprecipitetion (cell culture studies only) 



The PCR amplified cDNA fragments are subcloned into pTrc-His 
vectors (InVitrogen) to generate recombinant proteins for immunization and 
analysis of immunoreactivity. The recombinant proteins are purified by metal 
chelate affinity chromatography. The purified and concentrated (Centricon 
filter) recombinant fusion proteins are used to generate monoclonal and 
polyclonal antibodies for examining protein expression in brain and PNET cell 
lines. 

DNA sequence analysis suggests a need to generate polyclonal and 
MoAbs to 3 or 4 distinct recombinant truncated fusion proteins to distinguish 
among the different NTP molecules expressed in brain and PNET cell lines. 
Polyclonal antibodies are generated first and used to study gene expression. 
Rabbits are immunized with purified AD-NTP or FB-NTP truncated proteins 
(described above) (Harlow & Lane, Antibodies. A Laboratory Manual, Cold 
Spring Harbor Laboratory (1988) Cold Spring Harbor, NY). Pre-immune 
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senim, and senim obtained after the second boost are assayed for 
NTP-specific immunoreactivity by Western blot analysis and solid phase 
immunoassay using different amounts of recombinant protein, and with 
AD7c-NTP, as well as the other NTP truncated proteins as negative controls. 
5 The resulting polyclonal antibodies are used to examine the levels and cellular 

and tissue distributions of immunoreactivity by Western blot analysis and 
inununocytochemistry using a battery of testing materials (Table 4). 

All publications and patent applications mentioned in this specification 
are indicative of the level of skill of those in the art to which the invention 
10 pertains. All publications and patent applications are herein incorporated by 

reference to the same extent as if each individual publication or patent 
application was specifically and individually indicated to be incorporated by 
reference in their entirety. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: The General Hospital Corporation 

(ii) TITLE OF INVENTION : Neural Thread Protein Gene Expression and 

Detection of Alzheimer's Disease 

(iii) NUMBER OF SEQUENCES: 121 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Sterne, Kessler, Goldstein & Fox P.L.L.C. 

(B) STREET: 1100 New York Avenue, Suite 600 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S.A. 
<F) ZIP: 20005-3534 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: (to be assigned) 

(B) FILING DATE: 14 -NOV- 1995 

(C) CLASSIFICATION: 

(vii) PRIORITY APPLICATION DATA: 

<A) APPLICATION NUMBER: 08/340,426 

(B) FILING DATE: 14 -NOV- 1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Jorge A. Goldstein 

(B) REGISTRATION NUMBER: 29,021 

(C) REFERENCE/DOCKET NUMBER: 0609.384PC02 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 371-2600 

(B) TELEFAX: (202) 371-2540 



(2) INFORMATION FOR SEQ ID NO:l: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCGATTCCAA CAGACCATCA T 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
CCAACAGACC ATCATTCCAC C 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CCAAACCGAT TCCAACAGAC C 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
CCTGGGCAAC AAGAGCGAAA ACTCCATCTC 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
ATCGCTTGAA CCCGGGAGGC GGAGGTTGCG 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D) TOPOLOGY: both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
GGGGAGGCTG AGGCAGGAGA ATCGCTTGAA 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 



W 96/15272 



PCT/US95/17111 



-113- 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: both 
(D) TOPOLOGY: both 



FEATURE: 
(A) NAME/KEY: miec_feature 

(C) OTHER INFORMATION: /label« misc_feature 

/note- B N M represents a nucleotide linkage of 0-10 bases which 
links nucleotide "C" at position 21 and nucleotide "C* at position 23. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TACTACCAGA CAACCTTAGC CNCCGATTCC AACAGACCAT CAT 43 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ix) FEATURE: 

(A) NAME/KEY: raisc_feature 

(C) OTHER INFORMATION: /label « misc_feature 

/note- "N" represents a nucleotide linkage of 0-10 bases which 
links nucleotide "T" at position 21 and nucleotide n T" at position 23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CCGATTCCAA CAGACCATCA TNTACTACCA GACAACCTTA GCC 43 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 



WO 96/15272 



PCT/DS95/17111 



-114. 



(D) TOPOLOGY: both 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(C) OTHER INFORMATION: /label* mis c_f eature 

/notes "N° represents a nucleotide linkage of 0-10 bases whi< 
links nucleotide. "C at position 21 and nucleotide "C" at position 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
CCACCTTACT ACCAGACAAC CNCCAACAGA CCATCATTCC ACC 43 



12) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic. acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(C) OTHER INFORMATION: /labels misc_f eature 

/note= "N" represents a nucleotide linkage of 0-10 bases whic 
links nucleotide "C" at position 21 and nucleotide "C" at position 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCAACAOACC ATCATTCCAC CNCCACCTTA CTACCAGACA ACC 43 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(C) OTHER INFORMATION: /label- misc_feature 
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/note- "N" represents a nucleotide linkage of 0-10 bases which 
links nucleotide "C" at position 21 and nucleotide "C" at position 23. 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11; 
CCAGACAACC TTAGCCAAAC CNCCAAACOG ATTCCAACAG ACC 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: both 
(D) TOPOLOGY: both 

(ix) FEATURE : 

(A) NAME /KEY: misc_f eature 

<C) OTHER INFORMATION: /labels misc_f eature 

/note» "N" represents a nucleotide linkage of 0-10 bases which 
links nucleotide "C" at position 21 and nucleotide "C" at position 23. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCAAACCGAT TCCAACAGAC CNCCAGACAA CCTTAGCCAA ACC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGCTGCGCCC AGGCTGGCTC TGGAAAGCCT GTGCGGTCCT GGCAGGAAGC CCGGCCCGTQ 
GAGCAGGTTT TCGTTCTGCT TCAGCAATAA ATAAGGGTOA CCACAGGGAC TTTGCTTTTG 



60 
120 
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GTTTCCTTTC CTGTGAAAAG GTTGGTTTTA AAGTGAGATA CACTTTTCCG TAGAACAAGT 180 

GTTCTATCTT TAAAAACCCA AATTGCAGCA CCGTGGATTA CTGGTCTCAG AACAACTCAT 240 

TCCGCATCAG ATTTGACTCT CTGATTTTCT GTCTATTGGC CAAATTGCCC TTTAACTGCA 300 

CCTGAATCCT TTGTGTACTG ATGCCTTTGA GCTGGGCACC TTGGGAGAGT GTTGTGTTGC 360 

TGTTTACGGT TCTTCCTTCC CCTTGCTAAT TACAGTCTCT GGTGCCCAGC AAGCCCCTTT 420 

GGCTTCCTTC CGTGACTGGT CACGTTGTCT GCCTGGGCTC AGCGTGGACC TGCCCCATGC 480 

TGCAGAACCT GGCCTCACCT GGACTTTTAC TAGAATTGCC AGCTTCTCAA CTTAGCAGAT 540 

CATCACTCAT GCGGGCACAA GCAAAGATCA ACACTTTCTT TTTTGGTAAG CTTGAGTTTT 600 

ACAAGTTATT TTTTGGTGAT GCGTAAGACA TTGCAGTGGG AAACCATTCA ACTTGAGTTT 660 

ATTGGAGTTT GCTGTTGTAG CAGGTTTTAA CTCAGGAACA ACTCTTGTCT GATCTCTCGC 720 

CCCTCTGCCG GGACTACATT ACTGTCTCTC GGAGCCGGTA GCGTTGCTGT CGAGTCCCAG 780 

GACTATCTCT GCAGACTGCT ATGCTCAGAT CGAAGTATTT CACAAGAATA CTTGTGTTTT 84 0 

TAACAGCCCT TCCCCTGGAC GGTGCGCCAT GAGGGCCTCA TGTTACGCAT TGCCTTTTCT 900 

TTCTGTGGAT CCAGTATCTT CCTCGGCTTT TTAGGGAGCA GGAAAAATGC GTCTGAGAGC 960 

AACTCTTTTT AAAAACCTGC CCTGTTGTAT ATAACTGTGT CTGTTTCACC GTGTGACCTC 1020 

CAAGGGGGTG GGAACTTGAT ATAAACGTTT AAAGGGGCCA CGATTTGCCC GAGGGTTACT 1080 

CCTTTGCTCT CACCTTGTAT GGATGAGGAG ATGAAGCCAT TTCTTATCCT GTAGATGTGA 1140 

AGCACTTTCA GTTTTCAGCG ATGTTGGAAT GTAGCATCAG AAGCTCGTTC CTTCACACTC 1200 

AGTGGCGTCT GTGCTTGTCC ACATGCGCTG GGCGTCTGGA CCTTGAATGC CTGCCCTGGT 1260 

TGTGTGGACT CCTTAATGCC AATCATTTCT TCACTTCTCT GGACACCCAG GGCGCCTGTT 1320 

GACAAGTGTG GAGAAACTCC TAATTTAAAT GTCACAOACA ATGTCCTAGT GTTGACTACT 1380 
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ACAATCTTGA TGCTACACTO TTCTAATTAT TAAACTGATT ATTTTTCTTA TGTCAAAAAA X440 
*AA 1443 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GATCCCGTTT GACAGGTGTA CCGCCCCAGT CAAACTCCCC ACCTGGCACT GTCCCCGGAG 60 

CGGTCGCGCC CGCGCGACCA CGGAGCTCTG GGCGCCAGAA GCGAGAGCCC CTCGCTGCCC 120 

CCCGCCTCAC CGGGTAGTGA AAAAACGATG AG AGTAGTG G TATTTCACCG GCGGCCCGCG 180 

AGGACCCCCG CCCGACCCAG TGCGGAACGG GGG 213 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3S8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ 3D NO: 15: 

CACCGTGGAT TACTGGTCTC AGAACAACTC ATTGCGCATC AGATTTACTC TCTGANTTTC 60 

TGTCTATTGG CCATTCCCCT TTAACTGCAC CTGAATCCTT TGTGTACTGA TCCTTTGAGC 120 

TGGGCACCTT GGGAGAGTGT TGTGTTGCTG TTTACGGTTC TTCCTTCCCC TTGCTAATTA 1B0 

CAGTCTCTGG TGCCAGCAAG CCCCTTTGGC TTCCTTCCGT GACTGGTCAC GTTGTCTGCC 240 
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TGGGCCAGCG TGGCCCCATG CTGCAGAACC TGGCCTCAGG ACTTTTCACT AGAATTGCCC 300 
TTCCTCAACT TAGCAGATCA TTCACTCATG CGGGCACAAG CAAAGATCAA CACTTTCT 358 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: lb: 

CATTGCAGCT CAGCATGGCT CAGACCAGCT CATACTTCAT GCTGATCTCC TGCCTGATGT 60 

TTCTGTCTCA GAGCCAAGGC CAAGAGGCCC AGACAGAGTT GCCCCAGGCC CGGATCAGCT 120 

GCCCAGAAGG CACCAATGCC TATCGCTCCT ACTGCTACTA CTTTAATGAA GACCGTGAGA 180 

CCTGGGTTGA TGCAGATCTC TATTG CCAG A ACATGAATTC GGGCAACCTG GTGTCTGTGC 240 

TNCCCAGGCC GAGGGTGCCT TTGTGGCCTC ACTGATTAAG GAGAGTGGCA CTGATGACTT 300 

CAATGTCTGG ATTGGCCTCC ATGACCCCAA AAAGAACCGC CGCTGGCACT GGAG CAGTGG 360 

GTCCCTGGTC TCCTACAA 378 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
TTTTCCTAGA ACAAGGGTTC TATCTTTAAA AACCCAAATT GCAGCACCGC TGGTCTCAGA 60 
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ACAACTCATT GCGCATCAGA TTTACTCTCT GATTTTTCTG TCTATTTGGC CAAATTGCCC 120 
TTTTAACTGC ACCTGAATCT TT 142 

(2) INFORMATION FOR SEQ ID NO: 16 : 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 151 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: both 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TTCTCCTATA GAGATTGTTG ATTTGCCTCT TAG CAAGAG A TTCATTGCAG CTCAGCATGG 60 

CTCAGACCAG CTCATACTTC ATGCTGATCT CCTGCCTGAT GTTTCTGTCT CAGAGCCAAG 120 

GTAAGATCTC TTTTCCACCA ACCAACTCTT T 151 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CACCGTGGAT TACTGGTCTC AGAACAACTC ATTGCGCATC AGATTTACTC TCTGATTTTT 60 
CTGTCTATTG GCCAA 75 

(2} INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 77 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



CATTGCAGCT CAGCATGGCT CAGACCAGCT CATACTTCAT GCTGATCTCC TGCCTGATGT 



60 



TTCTGTCTCA GAGCCAA 



77 



. (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 

GAATTCCTGG GCTCAAGTGA TCCTCTCATG CAGTCTCCCA AAGTGCTGGG ATGACAGGCT 60 

TGAGCCACCA CACCAGGCCC ATCATCAGTT TATATAAAGA AAAAAAAACC TTAAAATTGT 120 

TAGGCAAATA 130 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY.: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 22: 



GGAGTTTTGT CATCAGGCCA GCCTCATCCC GAGGTCTCCT CCACCATTGG CCGTAGCCAG 



60 
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CAGGTTCAGT GCTCACCGAA AGTAAAATCC CCTCCTTCAG CAAGAATAAA GCAATATACA 120 
CCTTAGGTTC CACTAAGTAA CATA 144 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
(DJ TOPOLOGY: both 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GTTCTGTGAG TCTCAATTTG TTCCTTCTTG GAAGCTGTCT GGTGAATCTG TTGGTCCCTC 
TGTCTGCTAT TCTGTCTGTC TGTATGTCTG TCCATG 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GTTGATTTGC CTCTTAAGCA AGAGATTCAT TGCAGCTCAG CATGGCTCAG ACCAGCTCAT 
ACTTCATGCT GATCTCCTGC CTGATGTTTC TGTCTCAGAG CCAAG 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
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(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

ACTACCAAAC CTGCATTAAA AAATTTCGGT TGGTCGACCT CGGAGCAGAA CCCAACCTCC 60 

GAGCAGTACA TGCTAAGACT TCACCAGTCA AAGCGAACGT ACTATACTCA ATTGATCCAA 120 

TAACTTGACC AACGGAACAA GTTACCCTAT AACAGCGCAA TCCTATTCTA GAGTCCATAT 180 

CAACAGGGTT TACGACCTCG ATGTTGGATC AGGAC 215 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

ACCACGGAGC TCTGGGCGCC AGAAGCGAGA GCCCCTCGCT GCCCCCCGCC TCACCGGGTA 60 

GTGAAAAAAC GATGAGAGTA GTGGTATTTC ACCGGCGGCC CGCGAGGACC CCCGCCCGAC 120 

CCAGTGCGGA ACGGGGGAGT AGTCCCGGGG GCTCACTTAT TCTACATTAG TCTCACGTGC 180 

AGACTAGAGT CAAGCTCAAC AGGGTCTTCT TTCCCGCTGA TTCCGCCAAG TC 232 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

<A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
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AGTTTCACTC TGTTGCCCAG GCTGGAGTGC AATGGCACAA TCCTGGCTCA CTGCAACCTC 60 
CGCCTCCCGA GCTCAAGCAA TTCTCCTGCC TCAGCCTCGT GAGCCGCTGG GA 112 

(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base paire 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
AGATCTCGCT CTGTCACCCA GGCTGAAGTG CAGTGGCCCA ATCTCGGCTC ACTGCGAGCT 60 
CCACCTCCCG GGTTCACTTC ATTCTCCTGC CTCACTGCCT CAGCCTCTGA GTAGCTGGGA 120 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GAGGCGTATT ATACCATGCT CCATCTGOCT ACGACAAACA GACCTAAAAT CGCTCATTGC 60 

ATACTCTTCA ATCAGCCACA TAGCCCTOGT AGTAACAGCC ATTCTCATCC AAACCCCCTG 120 

AAGCTTCACC GGCGCAGTCA TTCTCATAAT CGCCCACGGG CTTACATCCT CATTACTATT 180 

CTGCCTAGCA AACTCAAACT AOGAACGCAC TCACAGTCGC ATCATAATCC TCTCTCAAGG 240 

ACTTCAAACT CTACTCCCAC TAATAGCTTT TTGATGACTT CTAGCAAGCC TCGCTAACCT 300 

CGCCTTACCC CCCACTATTA ACCTACTGGG AGAACTCTCT GTGCTAGTAA CCACGTTCTC 360 
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CTGATCAAAT ATCACTCTCC TACTTACAGG ACTCAACATA CTAGTCACAG CCCTATACTC 420 

CCTCTACATA TTTACCACAA CACAATGGGG CTCACTCACC CACCACATTA ACAACATAAA 480 

ACCCTCATTC ACACGAGAAA ACACCCTCAT GTTCATACAC CTATCCCCCA TTCTCCTCCT 540 

ATCCCTCAAC CCCGACATCA TTACCGGGTT TTCCTCTTAA AAAAAAAAAA AAAA 594 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Glu Ala Tyr Tyr Thr Met Leu His Leu Pro Thr Thr Asn Arg Pro Lys 
15 10 15 

He Ala His Cys He Leu Phe Asn Gin Pro His Ser Pro Arg Ser Asn 
20 25 30 

Ser His Ser His Pro Asn Pro Leu Lys Leu His Arg Arg Ser His Ser 
35 40 45 

His Asn Arg Pro Arg Ala Tyr He Leu He Thr He Leu Pro Ser Lys 
50 55 60 



Leu Lys Leu Arg Thr His Ser Gin Ser His His Asn Pro Leu Ser Arg 
€S 70 75 80 



Thr Ser Asn Ser Thr Pro Thr Asn Ser Phe Leu Met Thr Ser Ser Lys 
85 90 95 



Pro Arg 
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<2) INFORMATION FOR SEQ ID NO:31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : both 
<D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ATACCATGCT CCATCTGCCT ACGACAAACA GACCTAAAAT CGCTCATTGC ATACTCTTCA 60 

ATCAGCACAT AGCCCTCGTA GTAACAGCCA TTCTCATCCA AACCCCCTGA AGCTTCACCG 120 

GCGCAGTCAT TCTCATAATC GCCCACGGGC TTACATCCTC ATTACTATTC TGCCANCAAA 180 

CTCAAACTAC GAACGCACTC ACAGTCGCAT CATAATCTCT CTCAAGGACT TCAAACTCTA 240 

CTCCCAAGCT TTGTGACTTC TAGCAACCTC GCTAACCTCG CCTTACCCCC ACTATTAACT 300 

ACTGGGAGAA TGTGCTAGTA ACCACGTTCT CCTTCAAATA TCACTCTCCT ACTTACAGGA 360 

CTCAACATAC TAGTCCAGCC CTATACTCCC TCTACATATT TACCACAACA CAATGGGCTC 420 

ACTCACCCAC CACATTAACC ATAAAACCCT CATTCACACG AGAAAACACC CTCATGTTCA 4 B0 

TACACCTATC CCCCATTCTT CCTATCCCTC AACCCCOACA TCAACCGGGT TTCCTCTTAA 540 

AAAAAAAAAA AAAA 554 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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ATGCCTATCG CTCCTACTGC TACTACTTTA ATGAAGACCG TGAGACCTGG GTTGATGCAG 60 

ATCTCTATTG CCAGAACATG AATTCGGGCA ACCTGGTGTC TGTGCTCACC CAGGCCGAGG 120 

GTGCCTTTGT GGCCTCACTG ATTAAGGAGA GTGGCACTGA TGACTTCAAT GTCTGGATTG 180 

GCCTCCATGA CCCCAAAAAG AACCGCCGCT GGCACTGGAG CAGTGGGTCC CTGGTCTCCT 240 

ACAAGTCCTG GGGCATTGGA GCCCCAAGCA GTGTTAATCC TGGCTACTGT GTGAGCCTGA 300 

CCTCAAGCAC AGGATTCCAG AAATGGAAGG ATGTGCCTTG TGAAGACAAG TTCTCCTTTG 360 

TCTGCAAGTT CAAAAACTAG AGGCAGCTGG AAAATACATG TCTAGAACTG ATCCAGCAAT 420 

TACAACGGAG TCAAAAATTA AACCGGACCA TCTCTCCAAC TCAACTCAAC CTGGACACTC 480 

TCTTCTCTGC TGAGTTTGCC TTGTTAATCT TCAATAGTTT TACCTACCCC AGTCTTTGGA S40 

ACCTTAAATA ATAAAAATAA ACATGTTTCC ACTAAAAAAA AAAAAAAAAA 590 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 

CTTCTTTTTC AGGCCAAGAG GCCCAGACAG AGTTGCCCCA GGCCCGGATC AGCTGCCCAG 60 

AAGGCACCAA CCTATCGCTC CTACTGCTAC TACTTTAATG AAGACCGCGA GACCTGGGTT 120 

GATGCAGTGT GAGTGAGGAG AGCGTGTGGG AAGGGAGACT CATGAAGGGA GGGGAAGCTG 180 

CCACTCTCCA GTGTTCAGTG GOGCAATGAG ATGAGACTGA ACCCCTTTAT ACTATCATCA 240 

GCCCCAAACT TTCCAATCTA CTTTATCCCA TTATTCAGCA CATTCCCAGC ACAAAGAACC 300 

TGGTGGGTGA CAGCATCATC ACGGACATTA CTCTGCTGTC CTTTTTCACC CTCCTCTTGG 360 



WO 96/15272 



PCTAJS95/1711I 



-127- 

AGGACTCAGT ATATCCGTCA CAACCCTCCA CTGAGTCTCC ATTTTCTTCT GCAACAGCTC 420 
TATTGCCAGA ACATGAATTC GGGCAACCTG GTGTCTGTGC TCACCC 466 

12) INFORMATION FOR SEQ ID NO: 34 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

CGTATTAIAC CATGCTCCAT CTGCCTACGA CAAACAGACC TAAAATCGCT CATTGCATAC 60 

TCTTCAATCA GCCACATAGC CCTCGTAGTA ACAGCCATTC TCATCCAAAC CCCCTGAAGC 120 

TTNCCGGCGC AGTCATTCTC ATAATCGCCC ACGGGCTTAC ATCCTCATTA CTATTCTGCC 180 

TAGCAAACTC AAACTACGAA CGCACTCACA GTCGCATCAT AATCCTCTCT CAAGGACTTC 240 

AAACTCTACT CCCACTAATA GCTTTTTGAT GACTTCTAGC AAGCCTCGCT AACCTCGCCT 300 

TACCCCCCAC TATTAACCTA CTGGGAGAAC TCTCTGTGCT AGTAACCACG TTCTCCTGAT 360 

CAAAXATCAC TCTCCTACTT ACAGGACTCA ACATACTAGT CACAGCCCTA TACTCCCTCT 420 

ACATATTTAC CACAACACAA TGGGGCTCAC TCACCCACCA CATTAACAAC ATAAAACCCT 480 

CAITCACACa AGAAAACACC C 501 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
GTTCTTAGTC TATCTCTTGT ACAAACGATG TGCTTTGAAG AT6TTAGT6T ATAACAATTG 60 
ATGTTTOTTT TCTGTTTGAT TTTAAACAGA GAAAAAATAA AAGGGGGTAA TAGCTCCTTT 120 

riTurnjiix LTrrrrrriT ttcatttcaa aattgctgcc agtgttttca atgtaggaca iso 

ACAGAGGGAT ATGCTGTAGA GTGTTTTTAT TGCCTAGTTG ACAAAGCTGC TTTTGAATGC 240 

TGGTGGTTCT ATTCCTTTGC ACATCACGAC ATTTTATAAT CATAGTTAAA TCGTATATGA 300 

CAAAAATGCT CTGATCTGAT GCCAAAGGTC AATTCAGTGT ATATAACCTG AACACACTCA 360 

TCCATTGCGT TT 372 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

Met Phe Val Phe Cys I*eu He Leu Asn Arg Glu Lys He Lys Gly Gly 
15 10 15 

Asn Ser Ser Phe Phe Leu Leu Ser Phe Phe Phe Ser Phe Gin Asn Cys 
20 25 30 

Cys Gin Cys Phe Gin Cys Arg Thr Thr Glu Gly Tyr Ala Val Glu Cys 
35 40 45 

Phe Tyr Cys Leu Val Asp Lys Ala Ala Phe Glu Cys Trp Trp Phe Tyr 
50 55 60 

Ser Phe Asp Thr 
65 
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<2) INFORMATION FOR SEQ ID 110:37: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
<D) TOPOLOGY: both 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

ACTGTCTCCC CCTTTGATAG GGACACTAAA GTGGTCTGTA CTTGGGTAGA GGATGGCANG 60 

TTAAGAATTA AAATCGTCTG GGTGCGGTCT GCACGCTTGT AATCCCAGCA CTTTGGGAGG 120 

CTGAGGCGGG CGGATCACCT GAGGTCAGGA GTTCGACACC AGCCTGATGA ACATGGAGAA 180 

ACCCCATCTC TACTAAAAAT ACAAATATTA GCTGGGCGTT GTCGCGCGCC TGTAATCCCA 240 

GCGGCTCACG AGGCTGAGGC AGGAGAATTG CTTGAGCTCG GGATGGCGGA GGTTGCAGTG 300 

AGCCAGGATT GTGCCATTGC ACTCCAGCCT GGGCAACAAG AGTGAAACTC TGTCTCAAAA 360 

AAAAAAAAAA AAAAAAA 377 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GCGTAAACAC ATTTTTGTTC TTAGTCTATC TCTTGTACAA ACGATGTGCT TTGAAGATGT 60 

TAGTGTATAA CAATTGATGT TTGTTTTCTG TTTGATTTTA AACAGAGAAA AAATAAAAGG 120 

GGGTAATAGC TCCTTTTTTC TTCTTTCTTT GATTTTAAAC AGAGAAAAAA TAAAAGGGGG 180 
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TAATAGCTCC TTTTTTCTTC TTTCri " A 1 1 T TTTTTTCATT TCAAAATTGC TGCCAGTGTT 240 

TTCAATGATG GACAACAGAG GGATATGCTG TAGAGTGTTT TATTGCCTAG TTGACAAAGC 300 

TGCTTTCaAAT GCTGGTGOTT CTATTCCTTT GACACTACGC ACTTTTATAA TACATGTTAA 360 

TGCTATAGGA CAAGATGCTC TGATTCCTGA GTGCCAGAGG TTCAATTCAG TGTATATAAC 420 

TGAACACACT CATCCATTTG TGCTTTTGTT TTTTTTATGG TGGCTTAAAG GTAAAGAGCC 480 

CATCCTTTGC AAGTCATCCA TGTTGTTACT TAGGCATTTT ATCTTGGCTC AAATTGTTGG 540 

AAGAATGGTG GCTTGTTTCA TGGTTTTTGT ATTTGTGTCT AATGCACGTT TTAACATGAT €00 

AGACGCAATG CATTGTGTAG CTAGTTTTCT GGAAAAGTCA ACTCTTTTAG GAATTGTTTT 660 

TCAGATCTTC AATAAATTTT TTCTTTAAAT TTCAAAGAAC AATGTGCTTG TGTTGATGCC 720 

• TTACAAAAAC CATTGTATAT TTGTGTATTC CTTCTTGTAT TTAGACAGTG GTTTTTCAGG 780 

TTGCGTGCTTT GTTTTCTGGT ATGG CCTTTA TGGAATGAGA CGCTTTAGCT TTGGTACGTA 840 

GCGCTAATCC ATAGCAGCTT TGGCAGTTTC GTGTCTTGAG TCTTAGCTAA AAAGTTAGAA 900 

GTTTACATGA CTGTTTTTTT TATTTTCCCT AAATTATTAC TTACTCTGAG CATTAATTAA 960 

GGGCATTTTC ACCTGTGTAA AATTATGGTC AGCTTTTTTC TGTCTATAAT TGTTTACTTT 1020 

TGTGGGTTTA CTCTAGAAAC ATGAGCCAAA AATGTCAATA GACAACACAG TATTAAAATA 1080 

ACCCAAAAGT TGTAAAGGGC AACGTTTCTC CCCTTTGATA GGGACACTAA AGTGGTCTGT 1140 

ACTTGGGTAG AGGATGGCAG ACGTTAAGAA TTAAAATGCG TCTGGGTGCG GTCTCAOGCT 1200 

TGTAATCCCA GCACTTTGGG AGG CTGAGGC GGGCGOATCA CCTGAGGTCA GGAGTTCGAC 1260 

ACCAGCCTGA TGAACATGGA GAAACCCCAT CTCTACTAAA AATACAAATA TTAGCTGGGC 1320 

GTTGTCGCGC GCCTGTAATC CCAGCGGCTC ACGAGGCTGA GGCAGGAGAA TTGCTTGAGC 1380 

TCGGGATGGC GGAGGTTGCA GTGAGCCAGG ATTGTGCCAT TGCACTCCAG CCTGGGCAAC 1440 

AAGAGTGAAA CTCTGTCTCA AAAAAAAAAA AAAAAAAAAA 1480 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i*) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

TTGGGTGAGG TGGAATGATG GTC7GTTGGA ATGGGTTTGG TAAATGGGTT TATTTCATAT 60 

CCGCTATCTT TAACTTTGGA CCGCGTTATC TATATCATGG CGTTCCTTTC TACTTTTTAA 120 

TATTGGTTCG TATTATATCG TTCCTGATTG GGGATATGGA AGACGTATTA CTTAATTGTA 180 

CTTTATTGAA ACGTTCCTCT CGGTTTCGAT TCTGGGGGCT TTGGTCTGCT CGATGGATTC 240 

TTGTCGATTT TCTCGTGTGG CAGTAACATA CCGTTTTATC ACCCTTCTAA ATATCCCATC 300 

TCCCGCTGTT TGGTAGGCTC GGAACACTAT CGACCAACAG GTTCTATCTA GAATCAAGTT 360 

GGAAATTAAA CGGTGTCTTG G 381 



(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Met Met Val Cys Trp Ann Arg Phe Gly Lys Trp Val Tyr Phe He Ser 
15 10 15 

Ala lie Phe Asn Phe Gly Pro Arg Tyr Tyr His Gly Val Pro Phe 
20 25 30 
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Tyr Phe Leu He Leu Val Arg He He Ser Phe Leu He Gly Asp Met 
35 40 45 

Glu Asp Val Leu Leu Asn Cys Thr Leu Leu Lys Arg Ser Ser Arg Phe 
50 55 €0 

Arg Phe Trp Gly Ala Leu Val Cys Ser Met Asp Ser Cys Arg Phe Ser 
65 70 75 60 



Arg Val Ala Val Thr Tyr Arg Phe He Thr Leu Leu Asn He Pro Ser 
85 90 95- 

Pro Ala Val Trp Met Ala Arg Asn Thr lie Asp Gin Gin Val Leu Ser 
100 105 110 

Arg He Lys Leu Glu He Lys Arg Cys Leu 
115 120 



(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
<D) TOPOLOGY: both 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

CCCACAGGTC CTAAACTACC AAACCTGCAT TAAAAAATTT CGGTTGGTCG ACCTCGGAGC 60 

AGAACCCAAC CTCCGAGCAG TACATGCTAA GACTTCACCA GTCAAAGCGA ACGTACTATA 120 

CTCAATTGAT CCAATAACTT GACCAACGGA ACAAGTTACC CTAGGGATAA CAGCGCAATC 180 

CGATGGTGCA GCCGCTATTA AAGGTTCGTT TGTTCAAACG ATTAAAGTCC TCGTGTCTGA 240 

GTTCAGACCG AAGTAATCCA GGTCGGTTTC TATCTTCTTC AAATTCCTCC CTGTACCGAA 300 

AGGACTAATG AGAAATAAGG CCTACTTCAC AAAGCGGCCT TCCCCCGTAA TGATATCATC 360 
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TCAACTTAGT ATTATACCCA CACCCACCCA AGAACAGGTT TGTTAAAAAA AAAAAAAAAA 420 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

AACCCACTCC ACCTTACTAC CAGACAACCT TAGCCAAACC ATTTACCCAA ATAAAGTATA 60 

GGCGATAGAA ATTGAAACCT GGCGCAATAG ATATAGTACC GCAAGGAAAG ATGAAAAATT 120 

ATAACCAAGC ATAATATAGC AAGGACTAAC CCCTATACCT TCTGCATAAT GAATTAACAT 180 

GAAATAACTT TGCAAGGAGA GCCAAAGCTA AGACCCCCGA AACCAGACGA GCTACCTAAG 240 

AACAGCTAAA AGAGCACACC GTCATTGTAT GGCAAAATAG TGGGAAGATT TATAGGGTAG 300 

AGGGCGACAA ACCATCCGAG CCTTGTGATA GCTGGTTGTC CAAGATAGAT CTTAGTTCAA 360 

CCTTTAATTT GCCACAGAAC C 381 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TTTTTTTTTT TTTTTAACAA ACCCTGTTCT TGGGTGGGTG TGGGTATAAT ACTAAGTTGA €0 
GATOATATCA TTACGGGGGA AGGCCGCTTT GTGAAGTAGG CCTTATTTCT CATTAGTCCT 120 
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TTCGGTACAG GGAGGAATTT GAAGAAGATA GAAACCGACC TGGATTACTT CGGTCTGAAC 180 

TCAGACACGA GGACTTTAAT CGTTTGAACA AACGAACCTT TAATAGCGGC TGCACCATCG 240 

GGATGTCCTG ATCCAACATC GAGGTCGTAA ACCCTATTGT TGATATGGAC TCTAGAATAG 300 

GATTGCGCTG TTATCCCTAG GGTAACTTGT TCCGTTGGTC AAGTTATTGG ATCAATTGAG 360 

TTTAGTAGTC CGCTTGGAGT GGTGAAGTCT AGAATGTCCT GTTCGGGGGT TGGTTTCTGC 420 

TCCCAGGTCG CCCCAACCGA ATTTTTTATT GAAGGTTGGG TAGTTTAGCA CCTGTGGGTT 480 

GGTAAGGTAC TCTTGGAATT AATAAATTAA AGCTCCATAG GGTCTCCTCG TCTTGTTGTG 540 

TAATGCCCCC CTCTCCACGG GAAGGTCAAT TCCACTGGTT AAAAGTAAGA GAAAGCTGAA 600 

CCCTCGGGGA GCCATCCATA CAGGTCCCC 629 

U) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
<D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GCGGGTAAAT TGGTTTGTTA TTTTTTAAAA AAAACTTGCA TGTTTAAAAA AAAGTTGATT 60 

GCTTCAAATT TCTGCTACTA ACTTCAAGCT ATGGGAGTTT GGCAGTAGTC ACTTGAGGAT 120 

TTTTTTTCCA ATTCTTTTCT TTTTGTTGTT AAAGCTGTAC TTCAGTGAAC AGAAAAATTG 180 

CCAAGCAAAC TAATGGACTA TAAAGCGTAA TTTGACTGTG TGGGACTAAA CTACAGAGCC 240 



TACTTGACCA GTGGAT 



256 



(2) INFORMATION FOR SEQ ID NO: 45: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

CATGTTTAAA AAAAAGTTGA TTGCTTCAAA TTACTGCTAC TAACTTCAAG CTATGGGAGT 60 

TTGGCAGTAG TCACTTGAGG ATTTTTTTTC CAATTCGTTT TCATTTTTGT TGTTAAAGCT 120 

CGTACTTCAG TGAGACAGAA AAATTGCCAA GCTAAACTAA TGGTCTATAA AAGCGTAATT 180 

TGCATGTGTG GGCAAAAACT ACAGAGCCTC AATTGCCACT GAGGTATAGT ACAAAGTTTT 240 

AATACATTTT GTAAATCAAA TTGAAAGAAA 270 

(2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

CATGTTTAAA AAAAAGTTGA TTGCTTCAAA TTACTGCTAC TAACTTCAAG CTATGGGAGT 60 

TTGGCAGTAG TCACTTGAGG ATTTTTTTTC CAATTCGTTT TCATTTTTGT TGTTAAAGCT 120 

CGTACTTCAG TGAGACAGAA AAATTGCCAA GCTAAACTAA TGGTCTATAA AAGCGTAATT 180 

TGCATGTGTG GGCAAAAACT ACAGAGCCTC AATTGCCACT GAGGTATAGT ACAAAGTTTT 240 

AATACATTTT GTAAATCAAA TTGAAAGAAA 270 



(2) INFORMATION FOR SEQ ID NO: 47: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 

TCTGCCCAGG CTGGTCTGAA ATTCCTGGGC TGAAGTGATC CTCCAGTCTT GGCCTCCCAA • 60 

AGTGCTGGGA TTACAGGCAT GAGCTACTGA GCCTAGCCTT AATGATTAAT TTTAGAGTGA 120 

TGGCTTGTAC CTTCAaGACA CATATAGA7T GAGACAGAAA ATTTCCATCG TCCCCGAGAA 180 

AACT 184 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Ser Ser Ser lieu Gly Leu Pro Lys Cys Trp Asp Tyr Arg His Glu Leu 
15 10 15 

Leu Ser Leu Ala Leu Met He Asn Phe Arg Val Met Ala Cys Thr Phe 
20 25 30 

Lys Gin His He Glu Leu Arg Gin Lys He Ser He Val Pro Arg Lys 
35 40 45 

Leu Cys Cys Met Gly Pro Val Cys Pro Val Lys He Ala Leu Leu Thr 
50 55 60 



He Asn Gly His Cys Thr Trp Leu Pro Ala Ser 
65 70 75 
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(2) INFORMATION FOR SEQ ID NO:49: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1381 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

■ rmTrmi 1 gagatggagt tttcgctctt gttgcccagg ctggagtgca atggcgcaat go 

CTCAGCTCAC CGCAACCTCC GCCTCCCGGG TTCAAGCGAT TCTCCTGCCT CAGCCTCCCC 120 

AGTAGCTGGG ATTACAGGCA TGTGCACCAC GCTCGGCTAA TTTTGTATTT TTTTTTAGTA 180 

GAGATGGAGT TTAACTCCAT GTTGGTCAGG CTGGTCTCGA ACTCCCGACC TCAGATGATC 240 

TCCCGTCTCG GCCTGCCCAA AGTGCTGAGA TTACAGGCAT GAGCCACCAT GCCCGGCCTC 300 

TGCCTGGCTA ATTTTTGTGG TAGAAACAGG GTTTCACTGA TGTTGCCCAA GCTGGTCTCC 360 

TGAGCTCAAG CAGTCCACCT GCCTCAGCCT CCCAAAGTGC TGGGATTACA GGCGTCAGCC 420 

GTGCCTGGCC TTTTTATTTT ATTTTTTTTA AGACACAGGT GTACCACTCT TACCCAGGAT 480 

GAAGTGCAGT GGTGTGATCA CAGCTCACTG CAGCCTTCAA CTCCTGAGAT CAAGCAATCC 540 

TCCTGCCTCA GCCTCCCAAG TAGCTGGGAC CAAAGACATG CACCACTACA CCTGGTAATT 600 

TTTATTTTTA TTTTTAATTT TTTGAGACAG AGTCTCACTC TGTCACCCAG GCTGGAGTGC 660 

AGTGGCGCAA TCTTGGCTCA CTGCAACCTC TGCCTCCCGG GTTCAAGTTA TTCTCCTGCC 720 

CCAGCCTCCT GAGTAGCTGG GACTACAGGC GCCCACCACG CCTAGCTAAT TTTTTTGTAT 760 

TTTTAGTAGA GATGGGGTTT CACCATGTTC GCCAGGTTGA TCTTGATCTC TTGACCTTGT 640 

GATCTGCCTG CCTCGGCCTA CCCAAAGTGC TGGGATTACA GGTCGTGACT CCACGCCGGC 900 

CTATTTTTAA riTm»TTTG TTTGAAATGG AATCTCACTC TGTTACCCAG GTCGGAGTGC 960 
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AATGGCAAAT CTCGGCTACT CGCAACCTCT GCCTCCCGGG TCAAGCGATT CTCCTGTCTC 1020 

AGCCTCCCAA GCAGCTGGGA TTACGGGACC TGCACCACAC CCCGCTAATT TTTGTATTTT 1080 

CATTAGAGGC GGGTTTACCA TATTTGTCAG GCTGGGTCTC AAACTCCTGA CCTCAGGTGA 1140 

CCCACCTGCC TCAGCCTTCC AAAGTGCTGG GATTACAGGC GTGAGCCACC TCACCCAGCC 1200 

GGCTAATTTG GAATAAAAAA TATGTAGCAA TGGGGGTCTG CTATGTTGCC CAGGCTGGTC 1260 

TCAAACTTCT GGCTTCAGTC AATCCTTCCA AATGAGCCAC AACACCCAGC CAGTCACATT 1320 

TTTTAAACAG TTACATCTTT ATTTTAGTAT ACTAGAAAGT AATACAATAA ACATGTCAAA 1380 

C 1381 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

CCATTGTTAG GTTGTCTCTT ACCTGTTAAA ATCAGGAGCT GACAAGAAAT GCTTACCACA 60 

AAAGGAGAAA TGCCAGTCTA GTTAACAGTC AAGGAGAGAA ATCAGGAAGA TTATGTGGGT 120 

GGAAGAAGTA GATGATGTGG CTGATGAGTG AGTGAGTGAG CAAGCCTCCG CCCAGCTGAA 160 

GAAGGAGTCA GAACTGCCCT TTGTTCCCAA CTATTTGGCG AACCCCAGCC TTCCCTTTTA 240 

TCTATACACC CACAGCAGAG GATTCAGCCC AGATGCAGAA TGGGGGCCCC TCCACACCCC 300 

CTGCATCACC CCCTGCAGAT GGCTCACCTC CATTGCTTCC CCCTGGGAAC CTCCCCTGTT 360 

AGGGACCTTT CCCCGGGACC ACACCTCTTT GGCACTAGTT CAGAATGGTG ATGTGTCGGC 420 

CCCTCTGCCA TACTAGAACA CCAGAAAGAC AAACGGGTGA TGTTTGTCAG CTACAGTGAG 480 
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TCTAGAGCCG TCCTGTTTTC TTCTGTCCCG TCCCAAGCCA CCATGTCTCT TCOAGCCTCA 540 

AAATGGGACG TATGCAGGAC CAGCGCCCAG ATTCCAAGCC ATTTTTCTTC ACTGGAGCAT 600 

TTCCAXTTAA T ATG CAAG AG CTGGTACTCA AGGTGAGAAT TCAGAACCCA TCTCTTCGAG 660 

• AAAATGATTT CATTGAAATT GAACTGGACC GACAGAGCTC ACCTACCAAG AGTTGCTCAG 720 

AGTGTGTTGC TGTGAGCTGG GTGTTAATCC AGATCAAGTG GAGAAGATCA GAAAGTTACC 780 

CAATACTCTG TTAAGGAAGG ACAAGGATGT TGCTCGACTC AAGATTTCAG GAGCTGGAAC 840 

TGGTTCTGAT GATAGTGAAA ATAATTTTCT GTTCAGAAAT GCTGCATCAC ACTGACTGAA 900 

AGGCCTTGCT ATACAGGAGA GCTTCAAAAC TGACTTACTA ATGCAGCAGG GACTTTTATA 960 

CTGAGTATAT GACAGTGTGC ATCACCTCTG GGCCAAGGAC AAGCCATGAT CTAAATGCCT 1020 

CAGATGCCCG GGCCAGTCTG GTGCACTGCA TAGTATATAC GAACATCATT CTGCCCAAGG 1080 

TAGGAAGCCC CATGACCCCC AAGCAGTGGT GTCCACTCTT CCAAGCCTCT TGGTGCACAA 1140 

TAAACCTTAT TGCTTGAAGC TTTGAACGAC TGTGAGAATG GTCTGGCGAG GACGAGAACG 1200 

TGGAATTATA TGAGTGTCTT TTGTATCCGA GAATGTAGAG AGTTCTCTGA AGACGACGAC 1260 

TGAGAGAGAG CGGACGCTAT TTCTAGCCAC TCCTGTTGAC AGTGCACCTG AAGGGCTGGG 1320 

ATGCGTTTTT CTTGGTGTTG CATGCTCACA ACTCTGCTGA CATTGGGAAC TTATGAGAGA 1380 

GGAAGACTCG GGAAAGCACA GATACTGGAC AGATGGATTC TGGTGTGGGG AAAGCACAGA 1440 

TACTGGACAG ATGGTTCTAG TGTGACTTGT GACTGTGAGG TTTCCTATAA CATATTTATA 1500 

AATGTTCATC AGGTTCAAAA GTCTATAAGA ATACAGTTCG AGACTGAATT GCTTCGAAAT 1560 

ACTTCGATGT TGGOAACCAA AAOAOCTTTC CCTCCCTCAC TTTTTCCTTT GTAACACTCA 1620 

TGACTGCTTC TCTGTCTCGA GTCATCTCTG CATTAACTCC CCTTCGTGGT CACTAGAGGG 1680 

CTCTCTGATG CTTCTAAGAC ACTGCTTTTT ACATGCCACA CCCACCGCGT AGAGACAGGG 1740 

TCTCACTATG TGGCCCAGGC TGGTCT C AAA CTTCTGGCCT TAAGTGATCG TCCTGTCCTT 1800 



WO 96/15272 PCTAJS95/171 1 1 

-140- 

CGCGCTCGGA AGAAAGTCGT GGGGATTACA GGTGTGAGCC ACCCGCCCAG CCCCTCCCTT I860 

GTGTTTCAAC CAATCGGAAG TGAATTTAAC TAGATGTAGT AACCTTTTTT TTCTTTGACT 1920 

TCTAAAJiAAG TTACAGTTTA CTAATAAAGT TAAGTCTGGT TCTGTCCTAG AGGAAATAAA 1980 

TTCACTATTA ATTCATGTCT TAAGTTACTT GGGTTAAAAC ACTTTCAGCC ACCCAGATTA 2040 

ATTAAAGTGG AGCAGTGGAG CCCCTGGCTG GGGAGATGGG CCTCCAGAGG AGCAGCTGCA 2100 

GGCATGTTCT GGCTACACAG AGGCAAGCAA GGGACTGGTG TCTCTGGTGA GAGGTGGGTT 2160 

TGATGTATCT CTGTCCTATG CTGGTCTCTC TTCTCCTTTA TAAATCCTCC TGTGGTCACT 2220 

GACTATCGTA TCGCAGTGAT CAGACTGCAC ATAGTACGGT TAGGCTGAGC TTAATGTCTT 2 2 BO 

AATCATGTCA TTCGAGAGAA GACACGTTTT GATTCATGCT TTGTGTAATT AATCAATCAA 234 0 

GGATTCTTTT TTTAGCTTTG TTGACGTGTA ATTCACCCCT CCTCCTCCAC TG CAT ATTT A 2400 

AAGCATGTGT TCACACTGTG TGTATACATT CACTGCGATT TTTTCGTTTG CTGCATTGCT .2460 

TGGACTGTTC ATAACATCAC AAGTATTATT CAAATAAAAT ATTAACTGAC CGAAAAAAAA 2520 

(2) INFORMATION FOR SEQ ID NO:51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOIiOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

GAATTCCTGG GCTCAAGTGA TCCTCTCATG TCAGTCTCCC AAAGTGCTGG GATGACAGGC 60 

TTGAGCCACC ACACCAGGCC CATCATCAGT TTTTATATAA AGAAAAAAAA ACCTTAAAAT 120 

TGTTAGCAAA ATACTATOAC A 141 
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(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEEENESS: both 

(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

GAACTCCTGA CCTCAGGTGA TCCGCCCGCC TCAGCCTCCC AAAGTGCTGG GATTACAAGC 60 

GTGCAGACCG CACCCAGACG ATTTTAaTTC TTAACNTGCC ATCCTCTACC CAAGTACAGA 120 

CCACTTTAGT GTCCCTATCA AAGGGGAGAC A 151 



(2) INFORMATION FOR SEQ ID NO: 53: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
AATTCTCCTG CCTCAGCCTC GTGAGCCGCT GGGATTACAG GCG 43 



(2) INFORMATION FOR SEQ ID NO:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



W 96/15272 



PCT/US95/17111 



-142- 



AAGCCAACTC AGACTCAGCC AACAGGTAAG TGGGCATTAC AGQAG 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CTCAGTTTTC ACTTTGATCT GGTAGATAGT TTTCGTTTCA GTTGGGGGAG AAGGATCTGT 
TTGTAAGAAC GGAGTGACGG GAT AC CATAA AAATAGAGGT AATAACATAC ATTGG GACGT 
GTAAATTTAT TTTTATGGAA GTG 

(2) INFORMATION FOR SEQ ID NO: 56: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CTCACAGTTT CCTTATCTTG GTGGTCGTAA GTTTTCGTCG AAACAGTTGA TCGTTATTTG 
TGAGATTGTC GTATAGGGAG ACTAACAGGT AGTAACTTTT GTGACCGTCG TTAAAACTTT 
ACTTTTTTTT TTCTTTCTTC TTTTTTCCTT CATAATG 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 60 base pairs 



60 
120 
143 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
AOCACGCCCC GCTAATTTTT GTATTTTTAG TAGAGACAGG GTTTCACCGT GTTGGCCAGG 60 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 60 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
ACAACGCCCA GCTAATATTT GT ATT TTT A G TAGAGATGGG GTTTCTCCAT GTTCATCAGG 60 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 
CTGGTCTGAA ATTCCTGGGC TGAAGTGATC CTCCAGTCTT GGCCTCCCAA AGTGCTGGGA 60 



(2) INFORMATION FOR SEQ ID NO; 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CTGGTGTCGA ACTCCTGACC TCAGGTGATC CGCCCGCCTC AGCCTCCCAA AGTGCTGGGA 60 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 61: 
AGACACATAT AGATTGAGAC AGAAAA 26 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
AGTACAGACC ACTTTAGTGT CCCTATCAAA 



(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
AGATCTCGCT CTGTCACCCA GGCTGAAOTQ C 31 

(2) INFORMATION FOR SEQ ID NO:64:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 
AGAGTTTCAC TCTTGCTTGC CCAGGCTGGA GTGC 34 

(2) INFORMATION FOR SEQ ID NO: 65: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
AGTGGCCCAA TCTCGGCTCA CTGCGAGCTC CACCTCCCGG GTTCACTTCA TTCTCCTGC 59 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 
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AATGGCACAA TCCTGGCTCA CTGCAACCTC CGCCCTCCCG AGCTCAAGAA CTTCTCCTGC 60 



(2) INFORMATION FOR SEQ ID NO: 67; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CTCAGCCTCT GAGTAGCTGG GACTACAGGC GCCCACCACA AGCCGCTAAT TTTTGTATTT 60 
TTGTAG 66 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH.- 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6B: 
CTCAGCCTCG TGAGCCGCTG GGATTACAGG CGCGCGCCAC AAGCGACTAA TATTTGTATT 60 
TTTGTAG 67 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



ACCACGCCCC GCTAATTTTT GTATTTTTAG TAGAGACAGG GTTTCACCGT GTTGGCCAGG 



60 



ATGCTCGATC TCCTGA 



76 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
ACAACGCCCA GCTAATATTT GTATTTTTAG TAGAGATGGG GTTTCTCCAT GTTCATCAGG 60 
CTGGTGTCGA ACTCCTGA 78 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CCCCAAGCAG TGTTAATCCT GGCTACTGTG TGAGCTGACC TCAAGCACAG GTGAAGGCAG 60 

AGAATCCATC CACCTGTTTC TGTTCTCCCT GCTTAGCTCC AGGGATGGAA CTGGGACTGG 120 

GATAGAGGAA- AGGTQAACTC CTCATTAAGG AAATGGATG 159 



(2) INFORMATION FOR SEQ ID NO: 72: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CCCCTGTTCT TGGGTGGGTT TGGGTATATT CTGGTTGAGA TGATATCATT TACGGGGGAA 60 

GGCGCTTTGT GAAGTAGGCC TTATTTCTCT TGTCCTTTCG TACAGGGAGG ATTTGAAGTA 120 

GTAGAACGCT GTTACTCCGG TCTGAACTCA GTCACGTGGC TTTATCGTTG 170 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 
GATCCAAGCT AOGTACGCGT GCATGCACGT CATAGCTCTT CTATAGTGTC AC 52 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
GATCCGAGCT CGGTACCAAG TTGATGCATA GCTTGAGTAT TCTATAGTGT CAC 53 



WO 96/15272 



PCT/US95/17111 



-149- 



(2) INFORMATION FOR SEQ ID NO -.75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GTATGGGCCC GATAGCTTAT TTAGCCTTTA GAGCACACTG GCGGCCGTTA CTAGTGGATC 
CGAGCTCGGT ACCAACTTGA TGCATAGCTT GAGTATTCTA TAGTGTCACC TAAAT 



(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

ixl) SEQUENCE DESCRIPTION: SEQ ID N0:76: 
ATATAGACAA TATAACAATA TATTGTATAC TTTAGAGCAC ACTGGCAGCC GTTACTAGTG 
GATCCGAGCT CGGTACCAAG TTGATGCATA GCTTGAGTAT TCTATAGTGT CACTAATAGT 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CTTAATAGAT AGCTACTTAA AATAACTTAC ACACTGTTTT AGAGTGCTTG AAAACTATCT 



60 



W 96/15272 



PCTAJS95/17I11 



-150- 

GATCAGACAT AGTATTGAAA CCAATGAATA CATTATATAA AGTAAAGGAA AGGAGAA 117 

(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

CTTACTACCA GACAACCTTA GCCAAACCAT TTACCCAAAT AAAGTATAGG CGATAGAAAT SO 

TGAAACCTGG CGCAATAGAT ATAGTACCGC AAGGAAAGAT GAAAAATTAT AACCAAGCAT 120 

AATATAGCAA GGACTAA * 137 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

CTATAAAGGT CGTTGTCAAC GATAAAGCAC GTGACTGAGT TCAGACCGGA GTAACAGCGT 60 

TCTACTACTT CAAATCCTCC CTGCGAAAGG CAAGAGAAAT AAGGCCTACT TAAGCGCCTT 120 

CCCCCGTAAA TGATATCATC TCAACCAGAA TATACCCAAA CCCCCCAAGA ACAGGGGAGG 180 

AAAAGAAAAA AAAAAAAA 156 

(2) INFORMATION FOR SEQ ID NO:B0: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 200 base pairs 
(BJ TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

CTAXAGAGAT TGTTGATTTG CCTCTTAAGC AAGAGATTCA TTGCAGCTCA GCATGGCTCA €0 

GACCAGCTCA TACTTCATGC TGATCTCCTG CCTGATGTTT CTGTCTCAGA GCCAAGGTAA .120 

GATCTCTTTT CCAACTCTTT CTAGCCCTGA AGACTTCACT CTATCCCCAA GCATACGGGT 180 

CTACTTGAAA AAAAAAAAAA 200 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: both 
(D) TOPOLOGY : both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

CAGAAAACTA GCTACACAAT GCATTGGTCT ATCATGTTAA AACGTGCATT AGACACAAAT 60 

ACAAAAACCA TGAAACAAGC CA 82 

(2) INFORMATION FOR SEQ ID NO: 82; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D> TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

CAGAAAAATT GCCAAGCTAA ACTAATGGTC TATAAAAGCG TAATTTGCAT GTGTGGGCAT €0 
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AAACXACAGA GCTCATGCTA GAGTATGCAA 90 

(2) INFORMATION FOR SEQ ID NO:83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

AATTGGGTAC CGGGCCCCCC CTAGAGGTCG ACGGTATCGA TAAGCTTGTA TCGAATTCCG 60 

GACTTTGCTT TTGGTTTTCC TTTCCTGTGA AAAGGTTGGT TTTAAAGTGA GATACACTTT 120 

TCCGTAGAAC AAGTGTTCTA T , 141 

(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

AAGTTGATTG CTTCAAATTT CTGCTACTAA CTTCAAGCTA TGGGAGTTTG GCAGTAGTCA 60 

CTTGAGGATT TTTTTTCCAA TTCGTTTTCA TTTTTGTTGT TAAAGCTCGT ACTTCAGTGA 120 

GACAGAAAAA TTGCCAAGCT AAACTAATGG TCTAT 155 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 232 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

GAATTCCTGG GCTCAAGTOA TCCTCTCATG TCAGTCTCCC AAAGTGCTGG GATGACAGGC 60 

TTGAGCCACC ACACCAGCCC ATCATCAGTT TTTATATAAA GAAAAAAAAA CCTTAAAATT 120 

GTTAGGCAAA TAATGACAAA TTGTAATATA TATTCTTACA TTTCAGATTT TTATTTTTTA 180 

AACTGATAAG AATTGATTAA TAAATAAAAT TTAGTATTAA TCTGTCTTTT AA 232 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

AAATTCCTGG GCTGAAGTGA TCCTCCAGTC TTGGCCTCCC AAAGTGCTGG GATTACAGGC 60 

ATGAGCTACT GAGCCTAGCC TTAATGATTA ATTTTAGAGT GATGGCTTGT ACCTTCAAGC 120 

AACATATAGA GTTGAGACAG AAAATTTCCA TCGTCCCGAG AAAACTGTGC TGCATGGGCC 180 

CCGTGTCCCC TGTGAAOATC GCCCTATTAA CTATAAATGG GCATTGCACA TGGTTGCCAG 240 



CTTCA 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



245 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NOt87: 

AACGTTTCGA ACCTATCGTG AAGCCCGATT TTAGAGTTAA TACGGGTGCT TCAAGGGAAC 60 

4 

GGGGCTATGA GAAGTTTTCT ACGGGGAGCA TGGAAATTTT CTGTCTCAAT ATGTGCTTGA 120 

AGGTACAACC GTATCTAAAA TTAATCATTA AGGCTAGGCT CAGTAGCTCT GCCTGTAATC 180 

CCAGCACTTT CGGGAGGCCA AGACTGGAGG ATCACTTCAG CCCAGGAATT TCAGACGCC 239 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

TACTGTGTGA GCCTGACCTC AAGCACAGGA TTCCAGAAAT GGAAGGATGT GCCTTGTGAA 60 

GACAAGTTCT CCTTTGTCTG CAAGTTCAAA AACTAGAGGC AGCTGGAAAA TACATGTCTA 120 

GAACTGATCC AGCAATTACA ACGGAGTCAA AAATTAAACC GGACCATCTC TCCAACTCAA 180 

CTCAACCTGG ACACTCTCTT CTCTGCTGAG TTTGCCTTGT TAATCTTCAA TAGTTTTACC 240 

TACCCCAGTC TTTGGAACCT 260 



(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
CTTGCTTGCC CAGGCTGGAG TGCAATGGCA CAATCCTGGC TCACTGCAAC CTCCCCCTCC 60 
CGAGCTCAAG AACTTCTCCT GCCTCAGCCT CGTGAGCCGC TGGGATTACA GGCGCGCGCC 120 
ACAAGCGACT AATATTTGTA TTTTTGTAG 149 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
CTCGCTCTGT CACCCAGGCT GAAGTGCAGT GGCCCAATCT CGG CTCACTG CGAGCTCCAC 60 
CTCCCGGGTT CACTTCATTC TCCTGCCTCA CTGCCTCAGC CTCTGAGTAG CTGGGACTAC 120 
AGGCGCCCAC CACCACGTCC CCTGCTAATT TTTTGTATTT TTAGTAG 167 
(2) INFORMATION FOR SEQ ID N0:91: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: both 
<D) TOPOLOGY : both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 
CCATGTTCAT CAGGCTGGTG TCGAACTCCT GACCTCGTGA TCCGCCCGCC TCAGCCTCCC 60 
AAAGTGCTGG GATTACAAGC GTGC 84 



(2) INFORMATION FOR SEQ ID NO: 92: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 
CCGTGTTGGC CAGGATGGTC TCGATCTCCT GACCTCGTGA TCCGCCCGCC TTGGCCACCC 
AAAGAGTTTG GGATTACAGG CGTGC 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TGCAGCAATG GCAACAACGT CTGCAAACTA TTAACTGGCG AACTACTTAC TCTAGCTTCC 
CGGCAACAAT TAATAGACTG GATGGAGGCG GATAAAGTTG CAGGACCACT TCTGCGCTCG 
GCCCTTCCGG CTGGCTGGTT TATTGCTGAT AAATCTGGAG CCGGTCGAGC GTGGGTCTCG 
CGTATCATTC GAGCACTGGG GCCAGATGGT AAGCCCTCCG TATCGTAGTT ATCTCACAGC 
AGGGAGTCAG G 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

TGCAGGAGCG GGGAGGCACG ATGGCCGCTT TGGTCCGGAT CTTTGTGAGO AACCTTACTT 60 

CTGTGGTGTG ACATAATTGa ACAAACTACC TACAGAGATT TAAAGCTCTA AGGAAATATA 120 

AAATTTTTAA GTGTATAATG TGTTAAACTA CTGATTCTAA TTGTTTGTGT ATTTTAGATT 180 

CCAACCCTAT GGAACCTGAT GAATGGGAGC CAGTGGTGGA ATGCCTTTAA TGAGGAAACC 240 

TG 2 42 

(2) INFORMATION FOR SEQ ID NO; 95; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY; both 

(Xi) SEQUENCE DESCRIPTION: SEQ IDNO:95: 

TGCAGCAATC TTTCTTATAT ACATGCTTAA TAGATAGCTA CTI\ ; AAATAA CTTACACACG 60 

TTTTAGAGTT GCTTGAAAAC TATCT \TCA AGACATAGTA ATTGAAACCA ATGAATACAT 120 

TATATAAAGT AAAGGAAAGG AGAAGAGAGG AAAGGGAGGG GAAGAGGAGA GGGAGGGACA 180 

AGCGAGAAAG GAAAGGGAAG GGAGAAAA 208 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CTCACTAAAG GGATCAAGGA ATAATTTTGA ATTTCAAGTC TTACATTTAA TAAATACATT 60 

CATAAGGCTA TAACTACCAT ACGTTGTGAT TTCTCTGATT AATTTAAAAA TAAATTAAAA 120 

CCTGGAAAGA ATTTTACCAT TCTAGGAAGC CA 152 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

AATCTATCTT ATATACATGC TTAATAGATA GCTACTTAAA ATAACTTACA CACGTTTTAG 60 

AGTTGCTTGA AAACTATCTG ATCAAGACAT AGTAATTGAA ACCAATGAAT ACATTATATA 120 

AAGTAAAGGA AAGGAGAAGA GAGGAAAGGA GGGGAGAGGA GAGGAGGACA AGCGAGAAAA 160 

GGAAGGGAAG GGAGAAAAAG GGGGAAAGGG AGGTAGAGAG AGAGAGAAAA AGTGCTGGTC 240 

ATATAGTAAG TGTACATTTT AACTTTTTAA GAAACTACCC TACTCTATTC CAGAGTGATT 300 

GTACATGTGC ATTTTACTGC ATTATAGAGA TCATTTTC 338 

(2) INFORMATION FOR SEQ ID NO: 98 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 169 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
TGCAGGAGTG GGGAGGCACG ATGGCCGCTT TGGTCCGGAT CTTTGTGAAG GAACCTTACT 



60 
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TCTGTGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAACGTCTA AGGTAAATAT 120 
AAAATTTTTA GTGTATAGGT TAAACTACTG ATTCTAATGT TGTGTATTT 169 



(2) INFORMATION FOR SEQ ID NO: 99: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ IB NO: 99: 

CCCCGGGCTC CAGCAATGGC AACAACGTCT GCAAACTATT AACTGGCGAA CTCATTCATC 60 

TAGCTTCCCG GCAACAATTA ATGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT 120 

CGCGTGGCCC TTCCGGCTGG CTGGTTTATT GCTGATAATT GAGCGTGCGA GTGGCTCGCG 180 

TATCATTCGC GACATGGGCC AGTAGGTAC 209 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

CTTGCCCTTC ATGGAGTCAT ACAGCCGATC AGCAAAATGC AGGGGCTTGT TCTGAATGCA 60 

CTGAACCAGG TTCAGGAAAG CATTTTCCAG GTCTCCTTTA ACCTCTTTCC TGATGCTTTC 120 

CAACATGTCA TAAGGGCTGT AACTCTTGTA CCTATCAAAT ACTTTCTGGA GGTGGGGACA 180 

CGCTCGCGTC GGTCATGATG CTGATCCACT TGGGAACATC AGTTCTTTCC TCTTCACTCC 240 
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AGCTGCATAG AGATCCGAGG ACTCTTGGTC AA 272 



(2) INFORMATION FOR SEQ ID NO:101: 
* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

ACGGCCCAGC TTCCTTCAAA ATGTCTACTG TTCACGAAAT CCTGTGCAAG CTCAGCTTGG 60 

AGGGTGATCA CTCTACACCC CCAAGTGCAT ATGGGTCTGT CAAAGCCTAT ACTAACTTTG 120 

ATGCTGAGCG GGATGCTTTG AACATTGAAA CAGCCATCAA GACCAAAGGT GTGGATGAGG 180 

TCACCATTGT CAACATTTTG ACCAACCGCA GCAATGACAC GAGACAGGAT ATTGCCTTCG 240 

CCTACCAGAG AAGGACCAAA AAAGGAACTT GCATCACA 278 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 228 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY : both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 102: 

AGCAATAGCA AAGGAAAGGA AACAATATTT AGCAAGGTTT ATTCTTCCTT TGTGTCAGCA 60 

TTTCTGAGTG TGCACACAGG CCCAGTGATT CCATGTATTT TTGAGTGACC ACTGCCTCTG 120 

TCTGGCCCTT CCCCATAGAA CCGCCGCTGG TGGAGCGTGG GTCCCTGGTC TCCTACAAGT 180 

CCTGGGGCAT TGGAGCCCCA AGCAGTGTTA ATCCTGGCAC TGTGTNAG 228 
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(2) INFORMATION FOR SEQ XD NO: 103: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

AGCAATGGCA ACAACGTCTG CAAACTATTA ACTGGCGAAC TACTTACTCT TAGCTTCCGG 60 

CAACAATTAA TAGACTGGAT GGAGGCGGAT AAAGTTGCAG GACCACTTCT GCGCTCGGCC 120 

CTTCCGGCTG GCTGGTTTAT TGCTGATAAA TCTGGAGCCG GTCGAGCGTG GGTCTCGCGT 180 

ATCATTCGAG NCTGGGGCCA GATGGTAAGC CCTCCGTATC GTAGTTATCT CACAGCAGGG 240 

AGTCAG 246 



(2) INFORMATION FOR SEQ ID NO:104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID 110:104: 
CTGGCACTGG AGCAGTGGGT CCCTGGTCTC CTACAAGTCC TGGGGCATTG GAGCCCCAAG 60 
CAGTGTTAAT CCTGGCACTG TGTGAG 86 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY : both 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:105: 

CTGGAGCCGG TOGAGCGTGG GTCTCGCGTA TCATTCGAGN CTGGGGCCAG ATGGTAAGCC €0 

CTCCGTATCG TAGTTATCTC ACAGCCGTAT CATTCGAGNC TGGGGCCAGA TGGTAAGCCC 120 

TCCGTATCGT AGTTATCTCA CAGCAGGGAG TCAG 154 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 221 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:106: 

AGGAGACTTG TGGTAAAAAT CTGCTGCTGT ACTGCTCATT TGGGAACCTT AGTATACTAA 60 

ATAATATAAT ATATCAACAA CTAATGGTCA GCCAATGCTA TGCTGGATAT GAGGGTCCTG 120 

GGCCACAAAG ACAAAAAATC AGGAACCACT TTTTAAGTGA GATACTTTGG GTCTCTGTCA 180 

AATTCATAAC ACTTATTTCT TGGTGGAATA CAGTTAATGA G 221 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO.-107: 

AGGAGCGGGG AGGCACGATG GCCGCTTTGG TCCGGATCTT TGTGAGGAAC CTTACTTCTG 60 

TGGTCfGACA TAATTGGACA AACTACCTAC AGAGATTTAA AGCTCTAAGG AAATATAAAA 120 

TTTTTAAGTG TATAATGTGT TAAACTACTG ATTCTAATTG TTTGTGTATT TTAGATTCCA 180 

ACCCTATGGA ACCTGATGAA TGGGAGCCA6 TGGTGGAATG CCTTTAATGA G 231 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10B; 
TAAATCTGGA GCCGGTCGAG CGTGGGTCTC GCGTATCATT CGAGCACTGG GGCCAGATGG 60 
XAAGCCCTCC GTATCGTAGT TATCTCACAG CAGGGAGTCA GG 102 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

TAGAACCGCC GCTGGCACTG GAGCAGTGGG TCCCTGGTCT CCTACAAGTC CTGGGGCATT 60 

GOAGCCCCAA GCAGTGTTAA TCCTGGCTAC TGTGTGAGCC TGACCTCAAG 110 

(2) INFORMATION FOR SEQ ID NO: 110: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 85 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY : both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
ATAGATATCT ACTTTATTCG ATTTAAATTC TGTTTAGTAT TTTATTATAT TTTGTTAATC 60 
CATTTGTCCC AATTCATATA CTTAT 85 



(2) INFORMATION FOR SEQ ID NO: 111: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 
<C) STRAND EDNESS : both 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:lll: 
ATAGATAGCT ACTTAAAATA ACTTACACAC TGTTTTAGAG TGCTTGAAAA CTATCTGATC 60 
AGACATAGTA ATTGAAACCA ATGAATACAT TATAT 95 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : both 
<D) TOPOLOGY: both 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
GAAGGATCTG TTTGTAAGAA CGGAGCGGGA TACCATAAAA ATAGAGGTAA TAACATACAT 60 
TGGGACGTGT AAATTTATTT TTATNNAANT 90 
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(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: 
GAAGGATCTT ACCATTTTAA GAAAGGTCCA AAATTAAATA AAAATTTAAT TAGTCTCTTT 60 
AGTGTTGCAT ACCATCAATA TCGGAATACT AAAT 94 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

ACCTGGTGCT GTGCTCACCC AGGCCGAGGG TGCCTTTGTG GCCTCACTGA TTAAGGAGAG 60 

TGGCATGATG ACTTCAATGT CTGGATTGGC CTCCATGACC CCAAAAAGAA CCGCCGCTGG 120 

GGAGCGTGGG TCCCTGGTCT CCTACAAGTC CTGGGGCATT GGAGCCCCAA GCAGTGTTAA 180 

TCCTGGCTAC TGTGTGAGCC 200 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 base pairs 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS: both 
(D) TOPOLOGY: both 
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(xi) SEQUENCE DESCRIPTION: SEQ ID KO:115: 

AACTGGCGAA CTACITACTC TAGCTTCCGG CAACAATTAA TAGACTGGAT GGAGGCGGAT 60 

AAAGTTGCAG GACCACTTCT GCGCTCGGCC CTTCCGGCTG GCTGGTTTAT TGCTGATAAA 120 

TCTGGAGCCG GTGAGCGTGG GTCTCGCGTA TCATTGCAGC ACTGGGGCCA GATGGTAAGC 180 

CCTCCGTATC GTGGTTATCT ACACGACGGG GAGTACGC 218 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : both 
(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
AGGCCCATCA TCAGTTTTTA TATAAAGAAA AAAAAACCTT AAAATTGTTA GGCAAATACT 60 
ATGACAAATT GTAATATATA TTCTTACATT TCAGATTTTT ATTTTTTAAA CTGTATAGAA 120 
TTGATTAATA AATAAAATTT AGTATT 146 
(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
<D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
AGGCACGATG GCCGCTTTGG TCCGGATCTT TGTGAAGGAA CCTTACTTCT GTGTGTGACA 60 
TAA3TGGACA AACTACCTAC AGAGATTTAA ACGTCTAAGG TAAATATAAA ATTTTTAGTG 120 
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TAXAGGTTAA ACTACTGATT CTAATGTTGT GTATT 155 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

Ixl) SEQUENCE DESCRIPTION r SEQ ID NO: 116 : 
TAAATCTGGA GCCGGTGAGC GTGGGTCTCG CGTATCATTG CAGCACTGGG GCCAGATGGT 60 
AAGCCCTCCG TATCGTGGTT ATCTACACGA CGGGGAGTAC GG 102 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:119: 
TAGAACCGCC GCTGGCACTG GAGCAGTGGG TCCCTGGTCT CCTACAAGTC CTGGGGCATT 60 
GGAGCCCCAA GCAGTGTTAA TCCTGGCTAC TGTGTGAGCC TGACCTCAAG CACAGG 116 



(2) INFORMATION FOR SEQ ID NO: 120: 

(a) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 141B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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(lx) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 14.. 1416 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

TmTT TT r T GAG atg gag ttt TCG ctc TTG ttg ccc agg ctg GAG TGC 49 
Met Glu Phe Ser Leu Leu Leu Pro Arg Leu Glu Cys 
15 10 

AAT GGC GCA ATC TCA GCT CAC CGC AAC CTC CGC CTC CCG GGT TCA AGC 97 
Asn Gly Ala lie Ser Ala His Arg Asn Leu Arg Leu Pro Gly Ser Ser 
15 , 20 25 

GAT TCT CCT GCC TCA GCC TCC CCA GTA GGC TGG GAT TAC AGG CAT GTG 145 
Asp Ser Pro Ala Ser Ala Ser Pro Val Gly Trp Asp Tyr Arg His Val 
30 35 40 

CAC CAC GCT CGG CTA ATT TTG TAT TTT TTT TTA GTA GAG ATG GAG TTT 193 
His His Ala Arg Leu lie Leu Tyr Phe Phe Leu Val Glu Met Glu Phe 
45 50 55 60 

CTC CAT GTT GGT CAG GCT GGT CTC GAA CTC CGA CCT CAG ATG ATC CTC 241 
Leu His Val Gly Gin Ala Gly Leu Glu Leu Arg Pro Gin Met lie Leu 
65 70 75 



CCG TCT CGG CCT CCC AAA GTG CTA GAT ACA GGA CTG AGC ACC ATG CCC 269 
Pro Ser Arg Pro Pro Lys Val Leu Asp Thr Gly Leu Ser Thr Met Pro 
80 65 90 

GGC CTC TGC CTG GCT AAT TTT TGT GGT AGA AAC AGG GTT TCA CTG ATG 337 
Gly Leu Cys Leu Ala Asn Phe Cys Gly Arg Asn Arg Val Ser Leu Met 
95 100 105 

TGC CCA AGC TGG TCT CCT GAG CTC AAG CAG TCC ACC TGC CTC AGC CTC 365 
Cys Pro Ser Trp Ser Pro Glu Leu Lys Gin Ser Thr Cys Leu Ser Leu 
110 115 120 



CCA AAG TGC TGG GAT TAC AGG CGT GCA GCC GTG CCT GGC CTT TTT ATT 
Pro Lys Cys Trp Asp Tyr Arg Arg Ala Ala Val Pro Gly Leu Phe He 
125 130 135 140 



433 
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TTA TTT TTT TTA AGA CAC AGG TGT CCC ACT CTT ACC CAG GAT GAA GTG 481 
Leu Phe Phe Leu Arg His Arg Cys Pro Thr Leu Thr Gin Asp Glu Val 
145 150 155 

CAG TGG TGT GAT CAC AGC TCA CTG GAG CCT TCA ACT CTG AGA TCA AGC 529 
Gin Trp Cys Asp His Ser Ser Leu Gin Pro Ser Thr Leu Arg Ser Ser 
160 ICS 170 

ATC CTC CTG CCT CAG CCT CCC AAA GTA GCT GGG ACC AAA GAC ATG CAC 577 
He Leu Leu Pro Gin Pro Pro Lys Val Ala Gly Thr Lys Asp Met His 
175 160 185 

CAC TAC ACC TGG CTA ATT TTT ATT TTT ATT TTT AAT TTT TTG AGA CAG 625 
His Tyr Thr Trp Leu He Phe He Phe He Phe Asn Phe Leu Arg Gin 
190 195 200 

AGT CTC AAC TCT GTC ACC CAG GCT GGA GTG CAG TGG CGC AAT CTT GGC 673 
Ser Leu Asn Ser Val Thr Gin Ala Gly Val Gin Trp Arg Asn Leu Gly 
205 210 215 220 

TCA CTG CAA CCT CTG CCT CCC GGG TTC AAG TTA TTC TCC TGC CCC AGC 721 
Ser Leu Gin Pro Leu Pro Pro Gly Phe Lys Leu Phe Ser Cys Pro Ser 
225 230 235 

CTC CTG AGT AGC TGG GAC TAC AGG CGC CCA CCA CGC CTA GCT AAT TTT 769 
Leu Leu Ser Ser Trp Asp Tyr Arg Arg Pro Pro Arg Leu Ala Asn Phe 
240 245 250 

TTT GTA TTT TTA GTA GAG ATG GGG TTT CAC CAT GTT CGC CAG GTT GAT 817 
Phe Val Phe Leu Val Glu Met Gly Phe His His Val Arg Gin Val Asp 
255 260 265 

GCT AGA TCT CTT GAC CTT GTG ATC TGC CTG CCT CGG CCT CCC AAA GTG 865 
Ala Arg Ser Leu Asp Leu Val He Cys Leu Pro Arg Pro Pro Lys Val 
270 275 280 

CTG GGA TTA CAG GAC GTG ACG CCC ACC GCC CGG CCT ATT TTT AAT TTT 913 
Leu Gly Leu Gin Asp Val Tor Pro Thr Ala Arg Pro He Phe Asn Phe 
285 290 295 300 

TGT TTG TTT GAA ATG GAA TCT CAC TCT GTT ACC CAG GCT GGA GTG CAA 961 
Cys Leu Phe Glu Met Glu Ser His Ser Val Thr Gin Ala Gly Val Gin 
305 310 315 
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TGG CCA AAT CTC GGC TCA CTG CAA CCT CTG CCT CCC GGG CTC AAG CGA 1009 
Trp Pro Asn Leu Gly Ser Leu Gin Pro Leu Pro Pro Gly Leu Lys Arg 
320 325 330 

* 

TTC TCC TGT CTC AGC CTC CCA AGC AGC TGG GAT TAC GGG CAC CTG CAC 1057 
Phe Ser Cys Leu Ser Leu Pro Ser Ser Trp Asp Tyr Gly His Leu His 
335 340 345 

CAC ACC CCG CTA ATT TTT GTA TTT TCA TTA GAG GCG GGG TTT CAC CAT 1105 
His Thr Pro Leu lie Phe Val Phe Ser Leu Glu Ala Gly Phe His His 
350 355 360 

ATT TGT CAG GCT GGT CTC AAA CTC CTG ACC TCA GGT GAC CCA CCT GCC 1153 
lie Cys Gin Ala Gly Leu Lys Leu Leu Thr Ser Gly Asp Pro Pro Ala 
365 370 375 380 

TCA GCC TTC CAA AGT GCT GGG ATT ACA GGC GTG ACG CCT CAC CCA GCC 1201 
Ser Ala Phe Gin Ser Ala Gly lie Thr Gly Val Thr Pro His Pro Ala 
385 390 395 

GGC TAA TTT AGA TAA AAA AAT ATG TAG CAA TGG GGG GTC TTG CTA TGT 1249 
Gly 

TGC CCA GGC TGG TCT CAA ACT TCT GGC TTC ATG CAA TCC TTC CAA ATG 1297 
AGC CAC AAC ACC CAG CCA GTC ACA TTT TTA AAC AGT TAC ATC TTT ATT 1345 
TTA GTA TAC TAG AAA GTG ATA CGA TAA CAT GGC GGA ACC TGC AAA TTC 1393 
GAG TAG TAC AGA GTC TTT TAT AAC T 1418 



(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
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Met Glu Phe Ser Leu Leu Leu Pro Arg Leu Glu Cys Asn Gly Ala He 
15 10 15 

Ser Ala His Arg Asn Leu Arg Leu Pro Gly Ser Ser Asp Ser Pro Ala 
20 25 30 

Ser Ala Ser Pro Val Gly Trp Asp Tyr Arg Bis Val His His Ala Arg 
35 40 45 

Leu He Leu Tyr Phe Phe Leu Val Glu Met Glu Phe Leu His Val Gly 
50 55 60 

Gin Ala Gly Leu Glu Leu Arg Pro Gin Met He Leu Pro Ser Arg Pro 
65 • 70 75 80 

Pro Lys Val Leu Asp Thr Gly Leu Ser Thr Met Pro Gly Leu Cys Leu 
05 90 95 

Ala Asn Phe Cys Gly Arg Asn Arg Val Ser Leu Met Cys Pro Ser Trp 
100 105 HO 

Ser Pro Glu Leu Lys Gin Ser Thr Cys Leu Ser Leu Pro Lys Cys Trp 
115 120 125 

Asp Tyr Arg Arg Ala Ala Val Pro Gly Leu Phe He Leu Phe Phe Leu 
130 135 140 

Arg His Arg Cys Pro Thr Leu Thr Gin Asp Glu Val Gin Trp Cys Asp 
145 ISO 155 160 

His Ser Ser Leu Gin Pro Ser Thr Leu Arg Ser Ser He Leu Leu Pro 
165 170 175 

Glu Pro Pro Lys Val Ala Gly Thr Lys Asp Met His His Tyr Thr Trp 
180 185 190 

Leu He Phe He Phe He Phe Asn Phe Leu Arg Gin Ser Leu Asn Ser 
195 200 205 



Val Thr Gin Ala Gly Val Gin Trp Arg Asn Leu Gly Ser Leu Gin Pro 
210 215 220 
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Leu Pro Pro Gly Phe Lys Leu Phe Ser Cys Pro Ser Leu Leu Ser Ser 
225 230 23S 240 

Trp Asp Tyr Arg Arg Pro Pro Arg Leu Ala Asn Phe Phe Val Phe Leu 
245 250 255 

Val Glu Met Gly Phe His His Val Arg Gin Val Asp Ala Arg Ser Leu 
260 265 270 

Asp Leu Val He Cys Leu Pro Arg Pro Pro Lys Val Leu Gly Leu Gin 
275 2B0 265 

Asp Val Thr Pro Thr Ala Arg Pro He Phe Asn Phe Cys Leu Phe Glu 
250 295 300 

Met Glu Ser His Ser Val Thr Gin Ala Gly Val Gin Trp Pro Asn Leu 
305 . 310 315 320 

Gly Ser Leu Gin Pro Leu Pro Pro Gly Leu Lys Arg Phe Ser Cys Leu 
325 330 335 

Ser Leu Pro Ser Ser Trp Asp Tyr Gly His Leu His His Thr Pro Leu 
340 345 350 

He Phe Val Phe Ser Leu Glu Ala Gly Phe His His He Cys Gin Ala 
35S 360 365 

Gly Leu Lys Leu Leu Thr Ser Gly Asp Pro Pro Ala Ser Ala Phe Gin 
370 375 380 



Ser Ala Gly He Thr Gly Val Thr Pro His Pro Ala Gly 
390 395 400 
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What Is Claimed Is: 

L A method for detecting the presence of Neural Thread Protein 
(NTP) having a molecular weight of about 8 kDa, about 14 kDa, about 17 
kDa, about 21 kDa, about 26 KDa or about 42 kDa, in a human subject, said 
method comprising: 

(a) contacting a biological sample from said human subject 
that is suspected of containing said NTP with at least one molecule capable of 
binding to said protein; and 

(b) detecting any of said molecule bound to said protein. 

2. The method of claim 1 , wherein said molecule is selected from 
the group consisting of: 

(a) an antibody substantially free of natural impurities; 

(b) a monoclonal antibody; and 

(c) a binding fragment of (a) or (b), 

3. The method of claim 1, wherein the detecting of any of said 
molecule bound to said protein is performed by in situ imaging. 

4. The method of claim 1, wherein the detecting of any of said 
molecule bound to said protein is performed by in vitro imaging. 

5. The method of claim 1, wherein said molecule is administered 
to said human subject. 

6. The method of claim 1, wherein said molecule is bound to said 
protein in vivo. 
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7. A method of diagnosing the presence of Alzheimer's Disease 
in a human subject suspected of having Alzheimer's Disease which comprises: 

(a) incubating a biological sample from said subject which 
is suspected of containing NTP having a molecular weight of about 8 kDa, 
about 14 kDa, about 17 kDa f about 21 kDa, about 26 kDa or about 42 kDa, 
in the presence of at least one binding molecule capable of identifying said 
NTP; and 

(b) detecting said binding molecule which is bound in said 
sample, wherein said detection indicates that said subject has Alzheimer's 
Disease. 

8. The diagnostic method of claim 7, wherein said detection is by 
immunometric assay. 

9. The diagnostic method of claim 8, wherein said immunometric 
assay is a monoclonal antibody-based immunometric assay. 

10. The diagnostic method of claim 7 f wherein said method 
comprises: 

(a) incubating said biological sample with two different NTP 
monoclonal antibodies bound to a solid phase support; and 

(b) detecting NTP bound to said monoclonal antibodies with 
a third different detectably labeled NTP monoclonal antibody in solution. 

11. The diagnostic method of claim 7, wherein said incubating step 
further includes adding a known quantity of labeled Neural Thread Protein 
whereby a competitive immunoassay is established. 

12. The diagnostic method of claim 7, wherein said detection is by 
immuno-polymerase chain reaction. 
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13. A method of diagnosing the presence of neuroectodermal 
tumors in a human subject suspected of having a neuroectodermal tumor 
which comprises: 

(a) incubating a biological sample from said subject which 
is suspected of containing NTP having a molecular weight of about 8 kDa, 
about 14 kDa, about 17 kDa, about 21 kDa, about 26 kDa or about 42 kDa, 
in the presence of at least one binding molecule capable of identifying said 
NTP; and. 

(b) detecting said binding molecule which is bound in said 
sample, wherein said detection indicates that said subject has a 
neuroectodermal tumor. 

14. The diagnostic method of claim 13, wherein said detection is 
by an immunometric assay. 

15 . The diagnostic method of claim 14, wherein said immunometric 
assay is a monoclonal antibody-based immunometric assay. 

16. The diagnostic method of claim 13, wherein said method 
comprises: 

(a) incubating said biological sample with two different NTP 
monoclonal antibodies bound to a solid phase support; and 

(b) detecting NTP bound to said monoclonal antibodies with 
a third different detectably labeled NTP monoclonal antibody in solution. 

17. The diagnostic method of claim 13 , wherein said incubating step 
further includes adding a known quantity of the corresponding labeled NTP 
whereby a competitive immunoassay is established. 

18. The diagnostic method of claim 13, wherein said detection is 
by immuno-polymerase chain reaction. 
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19. A method of diagnosing the presence of a malignant 
astrocytoma in a human subject suspected of having a malignant astrocytoma 
which comprises: 

(a) incubating a biological sample from said subject which 
is suspected of containing NTP having a molecular weight of about 8 kDa, 
about 14 kDa, about 17 kDa f about 21 kDa, about 26 kDa or about 42 kDa, 
in the presence of at least one binding molecule capable of identifying said 
NTP; and 

(b) detecting said binding molecule which is bound in said 
sample, wherein said detection indicates that said subject has a malignant 
astrocytoma. 

20. A method of diagnosing the presence of a glioblastoma in a 
human subject suspected of having glioblastomas which comprises: 

(a) incubating a biological sample from said subject 
suspected of containing NTP having a molecular weight of about 8 kDa, about 
14 kDa, about 17 kDa, about 21 kDa, about 26 kDa or about 42 kDa, in the 
presence of at least one binding molecule capable of identifying said NTP; and 

(b) detecting said binding molecule which is bound in said 
sample, wherein said detection indicates that said subject suffers from a 
glioblastoma. 

21 . A Neural Thread Protein (NTP) substantially free of any natural 
impurities and having a molecular weight of about 42 kDa, 26 kDa, 21 kDa, 
17 kDa, 14 kDa, or 8 kDa. 

22. The NTP according to claim 21, wherein the protein comprises 
the amino acid sequence set forth in SEQ ID NO: 121. 

23. An isolated nucleic acid molecule coding for the NTP according 
to Claim 21. 
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24. The nucleic acid molecule according to claim 23, wherein the 
molecule comprises the nucleic acid sequence set forth in SEQ ID NO: 120 
which encodes the amino acid sequence set forth in SEQ ID NO: 121. 

25. The nucleic acid molecule according to claim 23, wherein the 
molecule encodes the amino acid sequence set forth in SEQ ID NO: 121. 

26. The nucleic acid molecule of claim 23 which is a plasmid. 

27. An expression vector comprising the nucleic acid molecule of 
claim 23. 

28. A host cell transformed with the plasmid of claim 26. 

29. A method of using the plasmid of claim 26 to prepare an NTP, 
said method comprising: 

(a) introducing said plasmid into a host cell to produce a 
recombinant host cell; 

(b) culturing said recombinant host cell; and 

(c) isolating said NTP from said recombinant host cell. 

30. A nucleic acid probe for the detection of the presence of NTP 
in a DNA sample from an individual comprising a nucleic acid molecule 
sufficient to specifically detect under stringent hybridization conditions the 
presence of the molecule according to claim 23 in said sample, wherein said 
probe is nonhomologous to a PIP nucleic acid sequence. 

31. The probe according to claim 30 t wherein said probe is a 15- 
to 30-mer antisense oligonucleotide which is complementary to an NTP 
nucleic acid sequence and which is nonhomologous to a PTP nucleic acid 
sequence. 
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32. A method of detecting the presence of a genetic sequence 
coding for NTP having a molecular weight of about 8 kDa, about 14 kDa, 
about 17 kDa, about 21 kDa, about 26 kDa or about 42 kDa, in a sample, 

which comprises: . 

(a) contacting said sample with the probe of claim 30 under 

conditions of hybridization; and 

(b) detecting the formation of a hybrid of said probe and 

said sequence. 

33. A method of producing an NT? having a molecular weight of 
about 8 kDa, about 14 kDa, about 17 kDa, about 21 kDa, about 26 kDa or 
about 42 kDa, said method comprising: 

(a) culturing a recombinant host comprising a human gene 
coding for said NTP; and 

(b) isolating said NTP from said host. 

34. The method of claim 33, wherein said host is E. coli. 

35. The method of claim 33, wherein said gene is contained by a 

vector. 

36. A substantially pure NTP having a molecular weight of about 
42 kDa, about 26 kDa, about 21 kDa, about 17 kDa, about 14 kDa, or about 
8 kDa obtained by the process of claim 33. 

37. A pharmaceutical composition comprising the probe according 
to claim 31 and a pharmaceutically acceptable carrier. 

38. A ribozyme comprising a target sequence which is 
complementary to an NTP sequence and nonhomologous to a PTP nucleic acid 
sequence. 
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39. A nucleic acid molecule which codes for the ribozyme of 
claim 38, 

40. A pharmaceutical composition comprising the NTP ribozyme 
of claim 38 and a pharmaceutical^ acceptable carrier. 

41. A method for inhibiting the expression of an NTP in a patient, 
said method comprising administering to said patient an effective amount of 
the antisense oligonucleotide of claim 31. 

42. A method for inhibiting the expression of NTP in a patient, said 
method comprising administering to said patient an effective amount of the 
ribozyme of claim 38. 

43. A method for inhibiting the expression of an NTP in a patient, 
said method comprising administering to said patient an effective amount of 
the DNA molecule of claim 39. 

44. An oligonucleotide comprising the sequence 3'X5'-I^5'X3\ 
wherein X comprises an NTP nucleic acid sequence which is nonhomologous 
to the PTP nucleic acid sequence, and wherein L represents an oligonucleotide 
linkage. 

45. An oligonucleotide comprising the sequence 5'X3'-L-3'X5' f 
wherein X comprises an NTP nucleic acid sequence which is nonhomologous 
to the PTP nucleic acid sequence, and wherein L represents an oligonucleotide 
linkage. 

46. A method to treat diseases or conditions mediated by the 
presence of an NTP having a molecular weight of about 8 kDa, 14 kDa, 17 
kDa, about 21 kDa, about 26 kDa or about 42 kDa, which method comprises 
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administering to a patient in need of such treatment an effective amount of the 
oligonucleotide of claims 44 or 45 t or a pharmaceutical composition thereof. 

47. A ribonucleotide NTP external guide nucleic acid comprising: 

(a) a 10-15 nucleotide sequence which is complementary to 
an NTP nucleic acid sequence and which is nonhomologous to the FTP nucleic 

acid sequence; and 

(b) a 3'-NCCA nucleotide sequence, wherein N is a purine. 

48. A method to treat diseases or conditions mediated by the 
presence of an NTP having a molecular weight of about 8 kDa, about 14 kDa, 
about 17 kDa, about 21 kDa, about 26 kDa or about 42 kDa, which method 
comprises administering to a patient in need of such treatment an effective 
amount of the ribonucleotide NTP external guide nucleic acid according to 
claim 47, or a pharmaceutical composition thereof. 

49. A virion comprising the expression vector of claim 27, 

50. A method to treat diseases or conditions mediated by the 
abnonnally low level of expression of an NTP having a molecular weight of 
about 8 kDa, 14 kDa, 17 kDa, about 21 kDa, about 26 kDa or about 42 kDa, 
which method comprises administering to a patient in need of such treatment 
an effective amount of the virion of claim 49. 
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51. A method of differentiating between sporadic and familial 
Alzheimer's Disease in a human subject, said method comprising: 

(a) obtaining a biological sample from said human subject 
who is suspected of having Alzheimer's Disease; 

(b) purifying DNA from said biological sample; and 

(c) contacting said DNA with the probe of claim 30 under 
conditions of hybridization; 

wherein familial Alzheimer's Disease is indicated by the detection of a hybrid 
of said probe and said DNA, and 

wherein sporadic Alzheimer's Disease is indicated by the absence of detection 
of hybridization. 

52. An antibody having binding affinity to the NTP of claim 21 but 
not to a FTP. 

53. A hybridoma which produces the monoclonal antibody 
according to claim 52. 
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1-9o T7 SEQUENCE 

Sequence Ronge: 1 to 1442 

CGCTG CGCCC AGGCT GGCTC TDGAA AGCCT GTGCG GTCCT GGCAG GAAGC CCGGC OCGTG 60 
GAGCA GGTTT TCGTI CTCCT TCAGC MTAA ATAAG GGTGA CCACA GGGAC TTTGC TTTTG 120 
GTTTC CTTTC CTGTG AAAAG GTTGG TTTTA AAGTG AGATA CACTT TTCOG TAGAA CAAGT 180 
GTTCT ATCTT TAAAA ACCCA AATTG CAGCA CCGTG GATTA CTGGT CTCAG AACAA CTCAT 240 
7GCGC ATCAG ATTTC ACTCT CTGAT TTTCT CTCTA TTGGC CAAAT TGCCC TTTAA CTGCA 300 
CCTGA ATCCT T7GTG TACTC ATGCC TTTGA GCTGG GCACC TTGGG AGAGT GTTGT GTTGC 360 
TGTTT ACGGT TCnC CTTCC CCTTG CTAAT TACAG TCTCT GGTGC CCACC AAGCC CCTTT 420 
GGCTT CCTTC GGTGA CTGGT CADGT TGTCT GCCTG GGCTC AGCGT GGACC TGCCC CATGC 480 
TGCA5 AACCT (JfflW CACCT GGACT TTTAC TAGAA TJGCC AGCTT CTCAA CTTAC CAGAT 540 
CATCA CTCAT GCGGG CACAA GCAAA GATCA ACACT TTCTT TTTTG GTAAG CTTGA GTTTT 600 
ACAAG TTATT TTTTG GTGAT GCGTA AGACA TTGCA GTGGG AAACC ATTCA ACTTG AGTTT 660 
ATTGG AGTTT GCTGT TGTAG CACGT TTTAA CTCAG GAACA ACTCT TGTCT GATCT CTCGC 720 
CCCTC TGCCG GGACT ACATT ACTGT CTCTC GGAGC CGGTA GCCTT GCTGT CGAGT CCCAG 780 
GACTA TCTCT GCAGA CTGCT ATGCT CAGAT CGAAG TATTT CACAA GAATA CTTGT GTTTT 840 
TAACA GCCCT TCCCC TGGAC GGTGC GCCAT GAGGG CCTCA TGTTA CGCAT TGGCT TTTCT 900 
TTCTG TGGAT CCAGT ATCTT CCTCG GCTTT TTAGG CAGCA GCAAA AATGC GTCTG AGAGC 960 
AACTC TTTTT AAAAA CCTGC CCTGT TGTAT ATAAC TGTGT CTGTT TCACC GTGTG ACCTC 1020 
CAAGG GGGTG GGAAC TTGAT ATAAA CGTTT AAAGG CGCCA CGATT TGCCC GAGGG TTACT 1080 
CCTTT GCTCT CACCT TGTAT GGATG AGGAG ATGAA GCCAT TTCTT ATCCT GTAGA TGTGA 1140 
AGCAC TTTCA GTTTT CAGCG ATGTT GGAAT GTAGC ATCAG AAGCT CGTTC CTTCA CACTC 1200 
AGTGG CCTCT CTGCT TGTCC ACATG CGCTG GGCGT CTGGA CCTTG AATGC CTGCC CTGGT 1260 
TGTGT GGACT CCTTA ATGCC AATCA TTTCT TCACT TCTCT GGACA CCCAG GGCGC CTGTT 1320 
GACAA GTGTG CACAA' ACTCC TAATT TAAAT GTCAC AGACA ATGTC CTAGT GTTGA CTACT 1380 
ACAAT GTTGA TGCTA CACTG TTGTA ATTAT TAAAC TGATT ATTTT TCTTA TGTCA AAAAA 1440 



Y/P5' SEQUENCE 

Sequence Range: 1 to 313 

GATCC CGTTT GACAG GTGTA CCGCC CCAGT CAAAC TCCCC ACCTG GCACT GTCCC CGGAG 
CGGTC GCGCC CGCGC GACCA CGGAG CTCTG GGCGC CAGAA GCGAG AGCCC CTCGC TGCCC 
CCCGC CTCAC CGGGT AGTGA AAAAA CGATG AGAGT AGTGG TATTT CACCG GCGCC CCGCG 
AGGAC CCCCG CCCGA CCCAG TGCCG AACGG GGG 
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^"1 5 10 15 20 

i M J " CAcc GtgGo TIA-C tgGlO 

HuDon - pip • CATT GCAGC TCAGC ATGGC 

9A*I-T7 25 30 35 40 45 50 55 60 65 70 75 

[ 386 J TOGA oCAoC TCATl gcgCA T-CoG ATlTo CTclC TGAIl mCT GTCTo UgGC CAltO 

Hunorf-PIP TCAGA CCAGC TCATA CTTCA TGCTG ATCTC CTGOC TGATG TITCT GTCTC AGAGC CAAGC 

9A»I-T7 85 90 95 100 105 110 115 125 130 135 140 
I 386 ] CCclt tooCl gcocC IGA-o TcCU IglCl oClGA TCcll TGogC tG— G GCACC -lTG-> 
Hunon-PIP CCAAG AGGCC CAGAC AGAGT TGCCC CAGGC C0GGA TCAGC TCCCC ACAAG GCACC AATGC 

9A»I-T7 145 150 155 160 165 170 175 180 185 190 195 
( 386 J ggAgo G-T-g TlgTG ITgCT -gHT AcgGt icilC cHc cCCTl GcToo IToCA G-TCT> 
Huwm-PTP CTATC GCTCC TACTC CTACT ACTTT AATGA ACACC GTGAG ACCTG GGTTG ATCCA GATCT 

9A*I-I7 200 205 210 215 220 225 230 235 240 245 250 255 
1 386 J CTggT GCCAG cA-4g ccccT UGCC tlCCT tccGT gocTG gTCAC gllGt CIGcc tGgGO 
Hunon-PTP CTATT GCCAG AACAT GAATT CGGGC AACCT GGTGT CTGTG CT Q C CCAGG CCGAG GGTGC 

9A.I-T7 260 270 275 280 285 290 300 305 310 315 

I 386 J CagcG TGCCC cCA-T GcTgc A-CAo ccTGG C-CTc AgCAC TTltc ocT-o GoATT GcCCT> 

Hunon-PIP CTTTG TGGCC ICACT GATTA AGGAG AGTGG CACTG ATGAC TTCAA TGTCT GGATT GCCCT 

9A»I-T7 325 330 335 340 345 350 355 360 365 370 375 

I 386 ] ICcTc A-oCt tAgcA GAtCo HCoC Tcotg CgGGc oCA-a Gcoao goTco oCdCt TlC-t> 

HunotH'TP CCATG ACCCC AAAAA GAACC GCCGC TGGCA CTGGA GCAGT GGGTC CCTGC TCTCC TACAA 

9 J~ 17 . 10 15 20 25 30 35 40 45 

[ 180 ] TT tTCCT AgA-A cA-oG -gCtT cTolC T-TTA A-AA -Accc ooATT> 

H REG GENE TT CTCCT ATAGA GATTG TTGAT TTGCC TCTTA AGCAA GAGAT TCATT 

9o-l7 50 55 65 70 75 80 85 90 95 100 105 110 
I 180 J GCAGC oCcGC tgGIC TCAGA aCAaC TCATl gcgCA T-CaG ATlTo CTclC TGATl TTTCT> 
H REG GENE GCAGC TCAGC ATGGC TCAGA CCAGC TCATA CTTCA TGCTG ATCTC CTGOC TGATG TTTCT 

9o-l7 115 120 125 130 135 140 145 150 155 
I 180 ] GTCTo UIGg CcAoo T-lGc cCT-T TTooC IgCAc ClgAo TCTTT> 
H REG GENE GTCTC AGAGC CAAGG TAAGA TCTCT TTTCC ACCAA CCAAC TCTTT 



FIG. 10 
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9A»I-T7 
[130] 
EX0N2 

9A«I-T7 
[130] 
EX0N2 



5 10 15 20 25 30 
-CAc eCt^J oTlA- CtgCt CTCAG AoCAo> 
CAT IGCAG CTCAG CATOC CTCAG ACCAG 



35 40 45 50 55 60 65 70 75 80 
CTCAT tgcgC AT-Co CATIT oCTcl CTCAT ITTTC TCTCT ottqG CCAA> 
CTCAT ACTTC ATCCT CATCT CCTGC CTCAT GTTTC TCTCT CACAC CCAA 



HUG GENE 

I I* ] 
WP03-4 T7 



5 10 15 20 25 30 35 40 45 

-G oAlTc cTGgg eTCAo CtgAt eCTC- TCoiG coGTC TCC- CA-oA qT-CO 
G GAGTT TTGTC ATCAG GCCAG CCTCA TCCCG AGGTC TCCTC CACCA TTGGC 



HflEG GENE 
[136] 
W03-4 T7 



50 55 60 65 70 75 80 85 95 JOO 

KMT £l£ fi£ 5^ ^ ^ TCAGl TAAAO 
CGTAG CCAGC AGGTT CACIG CTCAC CGAAA CTAAA ATCCC CTCCT TCAGC AAGAA TAAAG 



H RBI GENE 105 110 115 120 125 130 
1136 ] oAAoA oAaAC CTTAo ooH glTAg GcAA- ATA> 
WP03-4T7 CAATATACACCTTAGGnCCACTAACTAACATA 



VSP03 8SP 

[108] 

EX0N2 



120 115 110 105 100 95 90 85 80 75 
CTT-c TgTGo gTCTc AAttt gtlcc TTCIT gGoAC CT-G IcTGG -TqA- AlCtG 
CTTGA TTTGC CTCn AAGCA AGAGA TTCAT TGCAG CTCAG CATGC CTCAG ACCAG 



W03 8SP 70 65 60 55 50 45 40 35 30 

rvnSS J ^S 9 ! ^ l9lCT GcToT lCTGl CTC ~ T cTgTo TGTCT -G-t CCAtG 
CTCAT ACTTC ATCCT GATCT CCTGC CTGAT GTTTC TGTCT CAGAC CCAAG 



FIG. 1 0A 



WO 94/15272 



PCI7US9S/17I11 



13/52 



JB-4-296 260 255 250 245 240 235 230 

I , ! 2 / J — " <ACIA CcoAo C-CT- -OCtot IA-AA -«Ao tttCg gUG- -€tCg 

WSyM ACCA CGGAG CTCTC GGOGC CAGM GOGAG AGCCC CTCGC TGCCC 

AD3-4-296 225 220 215 210 205 200 195 190 185 180 175 170 

[ 1)2 J <aCWC -ggAg C-«Go AcccA Acctc CGAgc AGtac A-TGc TAogo CtlCo cCoGl CoaoG 

«P5 4/93 CCCGC CTCAC 0GGGT AGTGA AAAAA CGATG AGAGT AGTGG TATTT CACOG GOGGC OOGCG 

AD3-4-295 165 160 155 150 145 140 135 130 125 120 115 

[ 112 ] <-cGA- oCgta ClotA CtCAo T-IG AlCco otoAc UGoC CoocG Gooco AgTTA ccCTA 

W5 4/93 AGGAC COOCG CCGGA CCCAG TGGGG AAOGG GGGAG TAGTC CCGGG GGCTC ACTTA TTCTA 

AD3-4-296 105 100 95 90 85 80 75 70 65 60 55 50 

1 112 J <tAocA G-CgC Aotcc lAttC TAGAC TCcAl oTCAA CAGGG TIToC gocCl CGoTG -TTgg 

«P5 4/93 CATTA GTCTC ACGTG CAGAC TAGAG TCAAG CTCAA CAGGG TCnC TTTCC CGCTG ATTCC 



AD3-4-296 - 45 40 
(112] XotCAgCoC 
WP5* 4/93 GCCAA GTC 



AD2SP6F 10 15 20 25 30 35 40 45 50 55 60 65 

[ 504 ] AG-Tl TCoCT CTGTt gCCCA CGCTG gAGTC CAaTC CCoCA ATCcl GGCTC ACTGC oAcCT> 

1-9AT7-3 3 AGATC TCGCT CTGTC ACCCA CGCTG AAGTC CAGTG GCCCA ATCTC GGCTC ACTGC GAGCT 

AD2SP6F 70 75 80 85 90 95 100 105 110 115 

I 504 ] CCgCC TCCCG oGcTC Aogco ATTCT CCTCC CTCA- -GCCT C-G- — TGA GccGC TGGGA> 

1-9AT7-3 3 CCACC TCCCG GGTTC ACTTC ATTCT CCTGC CTCAC TGCCT CAGCC TCTGA GTAGC TGGGA 
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HB4-SEQ SEQUENCE 

GAGGC GTATT ATADC ATCCT CCATC TGCCT AOGAC AAACA GACCT AAAAT CGCTC ATTGC 60 
ATACI CTTCA ATCAG CCACA TAOCC CTCCT AGTAA CAGCC ATTCT CATCC AAAOC CCCTG 120 
AflOCT TCACC OOCGC AGTCA TTCTC ATAAT OGCOC ACGOC CTTAC ATCCT CATTA CTATT 180 
CTGCC TAGCA AACTC AAACT AOGAA CGCAC TCACA GTCGC ATCAT AATCC TCTCT CAAGC 240 
* ACTO AAACT CTACT CCCAC TAATA GCTTT TTCAT GACTT CTAGC AAGCC TCGCT AACCT 300 
OCOCT TACCC CCCAC TATTA ACCTA CTGGC AGAAC TCTCT GTGCT AGTAA CCACG TTCTC 360 
CTGAT CAAAT ATCAC TCTCC TACTT ACAGC ACTCA ACATA CTACT CACAG CCCTA TACTC 420 
CCTCT ACATA TTTAC CACAA CACAA TGGGG CTCAC TCACC CAGCA CATTA AGAAC ATAAA 480 
ACCCT CATTC ACACG AGAAA ACACC CTCAT GTTCA TACAC CTATC CCCCA TTCTC CTCCT 540 
ATCTC TCAAC CCCGA CATCA TTACC CGGTT TTCCT CTTAA AAAAA AAAAA AAAA 590 



H84 PROTEIN 

EAWT MLHLP TTNRP KIAHC IIFNQ PHSPR SNSHS WNPL KLHJR SHSHN RPRAY ILITI 60 
LPSKL KLRTH SOSHH NPLSR TSNST PTNSF LMTSS KPR g 5 
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I 440 J i T.r. 

m 

HB4-SEQ 15 20 25 30 35 40 45 50 55 fin m 
[ 440 J C-AT- GCTCC olCTC C-CT ACgoc M*e AGACC -T-A- AooTc GcTco ITGCA tJl 
Hun,on-PTP CTATC GCTCC TACTG CTACT ACTTT AAIGA AGACC GTGAG AttTC GGTTG HEm CAW 

A L Hf5 ° lC/>C ^ Clcfr - W 3to acoG- CcoTt CTCAt CCAao CCccc tGo fl0 > 
Hunon-PTP CTATT CCCAG AACAT GAATT OGGGC AACCT GGTGT CTGTG CTCAC CCASC CttS GC?cT 

7^?? l Sr 130 135 140 145 150 155 160 165 170 175 

Ln ™° ccGge cATT- ctcAt AoTcG C-Cco cgGgC TTocA T-cCT -cAN aclnT* 
»WPTP CTTTG TGGCC TCACT GATTA AGGAG AGTGG CACTG ATCAC TTCM Tg£t mr ODCT 

IwJtp ™? 2££ ^ Ct cAAaC lo0Go ocGCA CT ~ cA ^ CoToo TCTCl clCAA> 
HumarH>TP CCATG ACCCC AAAAA GAACC GCCGC TGGCA CTGGA GCAGT GGGTC CCTGG TCTCC TACAA 

ffj?? 2 ?^, 245 250 255 265 270 275 280 285 290 295 300 305 
I At* 952 - lco ° AcTcl Actt * CAAGC UIGT gAcTl CToGC oACct cGctA oCCTc oCCTt> 
Hunon-PTP GTCCT GGGGC ATTGG AGCCC CAAGC AGTGT TAATC CTGGC S TGTGA SS 

H4in 2L2?.An '^t2° 340 345 350 355 360365 370 

i J™ CcCc TTooc clAcT GGgAG oATGT G-CTo GT-AA -cCAc GTTCT CCTTe onnTn-y 
Humon-PTP AAGCA CAGGA TTCCA GAAAT GGAAG GATGT GCCTT GTGAA COA GTTCT 5n £$? 

IrSf? . 375 380 385 390 395 400 405 410 415 420 425 
I 440 ] tcAcT ctcct ActTA cAGG- A-CT- cAAcA TACto GTCcA GccCT -AToP Irfri Sacan 
Hunon^TP CAAGT TCAAA AACTA GAGGC AGCTG GAAAA MOT OKU «*T GATO ^ w£ 

U^fl ?n, 435 r/J? 445 450 455 460 465 ^0 475 480 485 
LUjo ISi! JJ? C MC0C Mt « aclc MM oXcA C-CAc oTtAA CCato AoACc CTCoT> 
Humon-PTP ACGGA GTCAA AAATT AAACC GGACC ATCTC TCCAA CTCAA CTCAA CCTGG ACACT CTCTT 

F2m A rn 495 500 505 5,0 5,5 520 525 530 540 545 
i J TO 0cCAG ~ floo ° Coccc TcAT 9 TTC-A TAcoc cTA— TcCCC CAITC TTccl AlCCo 
Humon-PTP CTCTG CTGAG TTTGC CTTGT TAATC TTCAA TAGTT TTACC TACCC CAGTC TTTGC AACCT 

r^f? 550 555 560 565 570 575 580 585 590 

A L , CCCgA cAlcA AcC 99 GTTTC ClCT l AAAAA AAAAA AAAAA A> 
Humon-PTP TAAAT AATAA AAATA AACAT GTTTC CACTA AAAAA AAAAA AAAAA A 
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H REG GENE 2260 2265 2270 2275 2280 2285 2290 2295 2300 2305 

I 284 J C tTcTT -TtlC AgGC- CdAgo gGCCc A-GAC AgAgl tgCC- ccAgg CcCgg ATcog> 

W4-SEQ C GTATT ATACC ATCCT CCATC TGCCT ACGAC AAACA GACCT AAAAT CGCTC ATTGC 

H REG GEN2310 2315 2320 2322330 2335 2340 2345 2350 2355 2360 2365 2370 
L 28 U cTgCc ^ ^ TAIC 9 CTCcT AcT-g CtoCl AcTtT oATgo AgAOC gCgoO 
HB4-SEQ ATACT CTTCA ATCAG CCACA TAGCC CTOGT AGTAA CAGCC ATTCT CATOC AAACC COCTG 

H REG G8NE 2375 2380 23852390 2395 2400 2405 2410 2415 2420 2425 2430 
I 284 ] -AcCT gggtt GotGC AGTgt gogTg AggAg cGCgl gtGGG oaggg AgoCT CATgA -oggg> 
«4-SEQ AAGCT T QCC GGCGC AGTCA TTCIC ATAAT CGCCC ACGGG CTTAC ATCCT CATTA CTATT 

H REG GENE 2435 2440 2445 2450 2455 2460 2465 2470 2475 2480 2485 

.L 2 2J 09G " aACC " ^ ActCT ^ lGl 9t TCAgt GgCGC Aolgo gAT-g agoCT qAAcO 
JB4-SEQ CTGCC TAGCA AACTC AAACT ACGAA CGCAC TCACA GTOGC ATCAT AATCC TCTCT CAAGG 

H REG GENE 24902495 2500 2505 2510 2515 2520 2525 2530 2535 2540 

L. LI cCTTl AlACT oTcaT ^ ccA " A aCTTT CCflAT — CTo CT~t lAtCC -ColT AttCo> 
HB4-SE0 ACTTC AAACT CTACT CCCAC TAATA GCTTT TTGAT GACTT CTAGC AAGCC TCGCT AACCT 

H REG GENE2545 2550 2555 2560 2565 2570 2580 2585 2590 2595 2600 
I 284 j gcoCo TICCC ogCAC oAogA ACCTg gTGGG IG-AC ogcoT cotC- AcggA Cotto cTCTq> 
W4-SEQ CGCCT TACCC CCCAC TATTA ACCTA CTGGG AGAAC TCTCT GTGCT AGTAA CCACG TTCTC 

HREG GEN2605 2610 2612620 2625 2630 2635 2640 2645 2650 2655 2665 2670 
I 284 ] CTG-T CcttT tTCAC cCTCC T-CTT ggAGC ACTCA gtATA tccGT CACAa CCCTc cACTg> 
HB4-SEQ CTGAT CAAAT ATCAC TCTCC TACTT ACAGG ACTCA ACATA CTAGT CACAG CCCTA TACTC 

H REG GENE 2675 2680 2685 2690 2695 2700 2705 2710 2715 2720 
l 284 ] ogTCT cCAT- TH-C ItC- IgCM co-G CTClo T-lgC CAgaA CATgA A-UC gggcA> 
HB4-SEQ CCTCT ACATA TTTAC CACAA CACAA TGGGG CTCAC TCACC CACCA CATTA ACAAC ATAAA 

H REG GENE 2725 2730 2735 2740 

[ 284 ] A-CCT -ggTg IC-IG IG— c tCACC O 

W4-SEQ ACCCT CATO ACACG AGAAA ACACC C 
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AD2-2 T7 



CTTCT TAGTC TATCT CTTGT ACAAA OGATG TGCTT TGAAG ATGTT AGTGT ATAAC AATTG 60 

ATGn TGTTT TCTGT TTGAT TTTAA ACAGA GAAAA AATAA AAGGG GGTAA TAGCT CCTTT 120 

mcr Tcm ctttt ttttt ttcat ttcaa aattg ctgcc agtgt tttca atgta ggaca m 

ACAGA GGGAT ATGCT GTACA GTGTT TTTAT TGOCT AGnG ACAAA GCTGC TTTTG AATGC 240 

TGG1G CTTCT ATTCC TTTGC ACATC ACGAC ATTTT ATAAT CATAG TTAAA TCGTA TATGA 300 

CAAAA ATGCT CTGAT CTGAT GCCAA AGGTC AATTC AGTGT ATATA ACCTG AACAC ACTCA 360 



»fVFC LILNR EKIKG GNSSF FLLSF FFSFQ NCCQC FOCRT TEGYA VECFY CLVDK AAFEC 60 



TCCAT TGCGT TT 



372 



A02-2 T7 PEP 



WWFYSFDT 



FIG.16A 
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AD2-2 SEQUENCE 



?S USS ATC TCnGTACAA ™AAGATGT 60 

TAGTGTATAA CAATTGATGT TTGTTTTCTG TTTGATTTTA AACAGAGAAA AAATAAAAGG 120 

S£™ TTCrTTCm ^AGAAAMA S £ 

Sl^^™™^ 240 
t/ww GACAACAGAG GGATATGCTG TAGAGTGTTT TATTGCCTAG TTGACAAAGC 500 

see 11 SEE™ » 

TGCTATAGGA CAAGATGCTC TGATTCCTGA GTCCCAGAGG TTCAATTCAG TGTATATAAC 420 

IG^CAcr mourn tgctttictt ttttttatgg tggcttaaag gtaaagagcc m 

AAGAATGGTG GCTTGTTTCA TGGTTTTTGT ATTTGTGTCT AATGCACGTT TTAACATGAT fiM 

SISflF™ MTAAATTTT TTCTTTAAAT TTCAAAGAAC AATGTGCTTG TGTTGATGCC 720 

G T ~ r ™\ T HGTGTATTC CTTCTTGTAT TTAGACAGTG ttS 76 

SESS SP*™ TGGMTGAGA ™ mm TTGGTACGTA 840 

EES™ 6TGTCTTGAC TCTTAGCTAA AAAGTTAGAA 900 

SUEF TATTTT OCI AAATTATTAC TIACTCTGAG CATTAATTAA 960 

S? 5S52f ^° TC mTTmc TGrcTATAAT rcmAcrn 1020 

S TCTAGAAAC ATGAGCCAAA AATGTCAATA GACAACACAG TATTAAAATA 1080 

S JESS" jMttinCIC "C™^ ^ACACTM S HO 
J?* 10 ** 5 ACGTTAAGAA TTAAAATGCG TCTGGGTGCG GTCTCACGCT 200 

52515? W™ 1 * ^GAGGTCA SS 6^ 

ACOGCCTGA TGAACATGGA CAAACCCCAT CTCTACTAAA AATACAAATA TTAGCTGOBC So 

2 ™tc ccagoggctc aogaggctga S S \m 

SHE gggg JSSffi SEES TGWCTCCAG ™«" 5 
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AD3-4 SEQUENCE 

ttggglgoggtqqo 

ATCAT OGTCT GTTGG AATDG GTTTG CTAAA TGGCT TTATT TCATA TCOGC TATCT TTAAC 60 
TTTGG AOCGC GTTAT CTATA TCATG GCGTT CCTTT CTACT TTTTA ATATI GGTTC GTATT 120 
ATATC GTTCC TGATT GGGGA TA1GG AAGAC GTATT ACTTA ATTGT ACTTT ATTGA AACGT 180 
TCCTC TCGGT TTCGA TTCTG GGGGC TTTGG TCTGC TCGAT GGATT CTTGT CGATT TTCTC 240 
GTGTG GCAGT AACAT ACCGT TTTAT CACCC TTCTA AATAT CCCAT CTCCC GCTCT TTGGT 300 
AGGCT CGGAA CACTA TCGAC CAACA GGnC TATCT AGAAT CAAGT TGGAA ATTAA ACGGT 360 
GTCH CG 367 

A03-4 PROTEIN 

WWCW NRFGK WYFI SAIFN FGPRY LYH3V PFYFL ILVR! ISFLI GOtCD VLLNC TLLKR 60 

SSRFR FWGAL VCSM) SCRFS RVAVT YRFIT LLNIP SPAVW MARNT I0QQV LSRIIf LEIKR 120 

CL ^2 



FIG.16F 



AD3-4T7 

CCCAC AGGTC CTAAA CTACC AAACC TGCAT TAAAA AATTT CGGTT GGTCG ACCTC GGAGC 1180 

AGAAC CCAAC CTCCG AGCAG TACAT GCTAA GACTT CACCA GTCAA AGCGA ACGTA CTATA 1240 

CTCAA TTGAT CCAAT AACTT GACCA ACGGA ACAAG TTACC CTAGG GATAA CAGCG CAATC 1300 

CTATT CTAGA GTCCA TATCA ACAAT AGGGT TTACG ACCTC GATGT TGGAT CAGGA CATCC 1360 

CGATG GTGCA GCCGC TATTA AACGT TCCTT TGnC AAACG ATTAA AGTCC TCGTG TCTGA 1420 

GTTCA GACCG AAGTA ATCCA GGTCG GTTTC TATCT TCTTC AAATT CCTCC CTGTA CCGAA 1480 

AGGAC TAATG AGAAA TAAGG CCTAC TTCAC AAAGC GGCCT TCCCC CCTAA TGATA TCATC 1540 

TCAAC TTAGT ATTAT ACCCA CACCC ACCCA AGAAC AGGGT TTGTT AAAAA AAAAA AAAAA 1600 
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AACCCACTCC ACCTTACTAC CAGACAACCT TAGCCAAACC ATTTACCCAA ATAAAGTATA 60 

GGCGATAGAA ATTGAAACCT GGCGCAATAG ATATAGTACC GCAAGGAAAG ATGAAAAATT 120 

ATAAOCAAGC ATAATATAGC AAOGACTAAC CCCTATACCT TCT6CATAAT GAATTAACAT 180 

GAAATAACTT TGCAAGCAGA CCCAAAGCTA AGACCCCCCA AACCAGACGA GCTACCTAAG 240 

AACAGCTAAA AGAGCACACC GTCATTGTAT GGCAAAATAG TGGGAAGATT TATAGGCTAG 300 

AGGGOGACAA ACCATCCGAG CCTTGTGATA GCTCGTTGTC CAAGATAGAT CTTAGTTCAA 360 

CCTTTAATTT GCCACAGAAC C 381 

FIG.16J 



ADJ-4T7 SEQUENCE 

TTTTTTTTTT TTTTTAACAA ACCCTGTTCT TGGGTGGGTG TGGGTATAAT ACTAAGTTGA 60 

GATGATATCA TTACGGGGGA AGGCCGCTTT GTGAAGTAGG CCTTATTTCT CATTAGTCCT 120 

TTCGGTACAG GGAGGAATTT GAAGAAGATA GAAACCGACC TGGATTACTT CGGTCTGAAC 180 

TCAGACAOGA GGACTTTAAT CGT7TGAACA AACGAACCTT TAATAGCGGC TGCACCATCG 240 

GGATGTCCTG ATCCAACATC GAGGTCGTAA ACCCTATTGT TGATATGGAC TCTAGAATAG 300 

GATTGCGCTG TTATCCCTAG GGTAACTTGT TCCGnCGTC AAGTTATTGG ATCAATTGAG 360 

TTTAGTAGTC CGCTTGGAGT GGTGAAGTCT AGAATGTCCT GTTCGGGGGT TGGTTTCTGC 420 

TCCCAGGTCG CCCCAACOGA ATTTTTTATT GAAGGTTGGG TAGTTTAGCA CCTGTGGGTT 480 

GGTAAGGTAC TGTTGGAATT AATAAATTAA AGCTCCATAG GGTCTCCTCG TCTTGTTGTG 540 

TAATGCCCCC CTCTCCACGG GAAGGTCAAT TCCACTGGTT AAAAGTAAGA GAAAGCTGAA 600 

CCCTCGGGGA GCCATCCATA CAGGTCCCC 629 

FIG.16K 



AD4-4 SP6 SEQUENCE 

Sequence Ronge: 1 to 256 

GCGGG TAAAT TGGTT TGTTA TTTTT TAAAA AAAAC TTGCA TGTTT AAAAA AAAGT TGATT 60 
GCTTC AAATT TCTGC TACTA ACTTC AAGCT ATGGG AGTTT GGCAG TAGTC ACTTG AGGAT 120 
TTTTT TTCCA ATTCT TTTCT TTTTG TTGTT AAAGC TGTAC TTCAG TGAAC AGAAA AATTG 180 
CCAAG CAAAC TAATG GACTA TAAAG CGTAA TTTGA CTCTG TGGGA CTAAA CTACA GAGCC 240 
TACIT GACCA GTGGA T 256 
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A04-4 T7F SEQUENCE ° /iiC 

Sequence Ronge: 1 to 270 

CATGT TTAAA AAAAA CTTGA TTGCT TCAAA TTACT GCTAC TAACT TCAAG CTATG GGAGT 60 
TTGGC AGTAG TCACT TGAGG ATTTT TTTTC CAATT CGTTI TCATT TTTGT TGTTA AAGCT 120 
CCTAC TTCAG TGAGA CAGAA AAATT GCCAA GCTAA ACTAA TGGTC TATAA AAGCC TAATT 180 
TGCAT GTGTG GCCAA AAACT ACAGA GCCTC AATTG CCACT GAGGT ATAGT ACAAA GTTTT 240 
AATAC ATTTT GTAAA TCAAA TTGAA AGAAA 270 

FI6.16M 



A04-4 SEQUENCE 

CATGTTTAAA AAAAAGTTGA TTGCTTCAAA TTACTGCTAC TAACTTCAAG CTATGGGAGT 60 
TTGGCAGTAG TCACTTGAGG ATTTTTTTTC CAATTCGTTT TCATTTTTGT TGTTAAAGCT 120 
GGTACTTCAG TGAGACAGAA AAATTGCCAA GCTAAACTAA TGGTCTATAA AAGCGTAATT 180 
TGCATGTGTG GGCAAAAACT ACAGAGCCTC AATTGCCACT GAGGTATAGT ACAAAGTTTT 240 
AATACATTTT GIAAATCAAA TTGAAAGAAA 270 

FIG.16N 



AD16c-T7 SEQUENCE 

TCTCC CCAGG CTGGT CTGAA ATTCC TGGGC TGAAG TGATC CTCCA GTCTT GGCCT CCCAA 60 
AGTGC TGGGA TTACA GGCAT GAGCT ACTGA GCCTA GCCTT AATGA TTAAT TTTAG AGTGA 120 
TGGCT TGTAC CTTCA AGACA CATAT AGATT GAGAC AGAAA ATTTC CATCG TCCCC GAGM 180 



ADI6c-T7 PEP 

5 10 15 20 25 30 35 40 45 50 
SSSLG IPKCW DYRHE LLSLA LMINF RVMAC TFKQH 1ELRQ KISIV PRKLC OGPV CPVKI 



55 60 



ALLTI NGHCT WLPAS 



65 70 75 
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AD10-7 SEQUENCE 

2SS52? C £ AACCT0C GCCTCCCGGC TTCAAGOGAT TCTCCTGCCT CAGCCTOCCC 120 
AGTAGCTGGG ATTACAGGCA TGTGCAOCAC GCTOGGCTAA TTTTGTATTr TTTTTT/GTA 80 
™AT GTTGGrCAOG CTGGTCTCGA ACtS S5S 240 
JCCCGTCTCC GCCTGCCCAA AGTCCTCACA TTACA6GCAT GAGCCACCAT CCCOXmc 300 
2JSS CTA ATTTTTCTGG T ^GG CTTTCACTCA SS OKTCTCC 3£ 
££££ SSP52 TOX^SgtS «0 

EES EES? £2!"™ AGACACAGGT GTACCACTCT TA00CAGGAT 3 
SSXF aaCTB * T » CAGCTCACTG CAGCCTTCAA CTCCTGAGAT CAAGCAATCC 540 
™" 5™ JSgW CAAAGACATG CACCACTACA SSS 6^0 
JCfiK ESSE US*** mCJCfC1C TGTCA0CCA5 GCTGGAGTGC 660 
SESS f?**™ TGCCTCO0GG GTTCAAGTTA TTCT CCTGCC 720 

S JESSS" camc ^ aTw;c ™ T TTTTTTGTAT 

SIS SS5SJI £SEf SI TC GCCAGGTTGA TCTIGATCTC TCGACCTTGT 840 
crSJ S ^ TGC T0GGATTACA CCACGCOGGC 900 

TifJII 1 ™ TTTTTGTTTG TTTGAAATGG AATCTCACTC TGTTACCCAG GTCGGAGTGC 960 

£™ gggg ^ TCC0GGG Tc ™ g ™ S» 

5SrS? JIJSP* TGCACCACAC a**™" TT7CTATTTT 1080 

^™5E ^ACCA TATTTGTCAC GCTGGGTCTC AAACTCCTGA CCTCAGGTGA 1140 
CCCACCTGCC TCAGCCTTCC AAAGTGCTGG GATTACAGGC CTGAGCCACC 200 

TA I GTAGCAA TGGGGGTCTG CT ™ S 12W 
TCAAACTTCT GGCTTCAGTC AATCCTTCCA AATGAGCCAC AACACCCAGC CAGTCACATT So 
TTTIAAACAG TTACATCTTT ATTTTAGTAT ACTAGAAAGT mSS in 

1381 
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AD16c-SEQUENC£ 

CCATTGTTAG GTTGTCTCTT ACCTGTTAAA ATCAGGAGCT GACAAGAAAT GCTTACCACA 60 

AAAGGAGAAA TGGCAGTCTA GTTAACAGTC AAGGAGAGAA ATCAGGAAGA TTATGTGGGT 120 

GGAAGAAGTA GATGATGTGG CTGATGAGTG AGTGAGTGAG CAAGCCTCOG CCCAGCTGAA 180 

CAAGGAGTCA GAACTGCCCT TTGTTOCCAA CTATTTGGCG AACOCCAGOC TTCCCTTTTA 240 

TCTATACACC CACAGCAGAG CATTCACCCC AGATGCACAA TGGGGCCCCC TCCACACCCC 300 

CTGCATCAOC GCCTGCAGAT GGCTCACCTC CAnGCnOC CCCTGGGAAC CTCCCCTGTT 360 

AGGGACCm OCCCGCGACC ACACCTCTTT GGCACTAGTT CAGAATGGTG ATGTGTCGGC 420 

CCCTCTGCCA TACTAGAACA CCAGAAAGAC AAAOGGGTGA TGTTTGTCAG CTACAGTGAG 480 

TCTAGAGCCG TCCTGTTTTC TTCTGTCCCG TCOCAAGCCA CCATGTCTCI TCGAGCCTCA 540 

AAATGGGACG TATGCAGGAC CAGCGCCCAG ATTCCAAGCC ATTTTTCTTC ACTGGAGCAT 600 

TTCCATTTM TATGCAAGAG CTGGTACTCA AGGTGAGAAT TCAGAACCCA TCTCTTCGAG 660 

AAAATGATTT CATTGAAAJT GAACTGGACC GACAGAGCTC AOCTACCAAG AGTTGCTCAG 7?0 

SIIS TGTGAGCTGG GTGTTAATCC AGATCAAGTG GAGAAGATCA GAAAGTTACC 780 

SSJEP TTAAGGAAGG ACAAGGATGT TGCTCGACTC AAGATTTCAG GAGCTGGAAC 840 

ISSSil GATAGTGAAA ATAATTTTCT GTTCACAAAT GCTGCATCAC ACTGACTGAA 900 

AGGCCTTGCT ATACAGGAGA GCTTCAAAAC TGACTTACTA ATGCAGCAGG GACTTTTATA 960 

S2SS2S GtCHSKfK ATCACCTCTG GGCCAAGGAC AAGGCATGAT CTAAATGCCT 1020 

CAGATGCCCG GGCCAGTCTG GTGCACTGCA TAGTATATAC GAACATCATT CTGCCCAAGG 1080 

1^^° CATGACCCCC AAGCAGTGGT GTCCACTCTT CCAACCCTCT TGGTGCACAA 1 140 

I^IIf ¥ cmMSC TTTGAAOGAC TGTGAGAATG GTCTGGGGAG GAOGAGAACG 1200 

JESSES IS! 0101011 "GTATCCGA GAATGTAGAG AGTTCTCTGA AGACGACGAC 1260 

5?^™ TTCTAGCCAC TCCTGTTGAC AGTGCACCIG AAGGGCTGGG 1320 

fj 10 ™™ CA^CACA ACTCTCCTGA CATTGGGAAC TTATGAGAGA 1380 

CGAAGACTOG GGAAAGCACA CATACTGGAC AGATGGATTC TGGTGTGGGG AAAGCACAGA 1440 

liSSSJ ATGGTTCTAG TGTGACTTGT GACTGTGAGG TTTCCTATAA CATATTTATA 1500 

MTGTTCATC AGGTTCAAAA GTCTATAAGA ATACAGTTCG AGACTGAATT GCTTCGAMT 1560 

SS! GT ^GGGAACCAA AAGAGCTTTC CCTCCCTCAC TTTTTCCTTT GTAACACTCA 1620 

ESS?? TCTGTCTCGA GTCATCTCTG CATTAACTCC CCTTCGTGGT CACTAGAGGG 1680 

CTTCTAAGAC ACTGCTTm ACATGOCACA CCCACOGCGT AGAGACAGGG 1740 

TCTCACTATG TGGCCCAGGC TGGTCTCAAA CTTCTGGGCT TAAGTGATCG TCCTGTCCTT 1800 

552SI? GA ^**BTOT GGGGATTACA GGTGTGAGCC ACCCGCOCAG CCCCTCCCTT I860 

GTGTTTCAAC CAATCGGAAG TGAATTTAAC TAGATGTAGT AAOCTTTTTT TTCTTTGACT 1920 

Z^f^ TTACAGTTTA CTAATAAAGT TAAGTCTGGT TCTGTCCTAG AGGAAATAAA 1980 

HSfSP ATTCATGTCT TAAGTTACTT GGGTTAAAAC ACTTTCAGOC ACCCAGATTA 2040 

ATTAAAGTGG AGCAGTGGAG CCCCTGGCTG GGGAGATGGG OCTCCAGAGG AGCAGCTGCA 2100 

GGCATCTTCT GGCTACACAG AGGCAAGCAA GGGACTGGTG TCTCTGGTGA GAGGTGGGTT 2160 

TGATGTATCT CTGTCCTATG CTGGTCTCTC TTCTCCTTTA TAAATCCTCC TGTGGTCACT 2220 

GACTATOGTA TCGCAGTGAT CAGACTGCAC ATAGTACGGT TAGGCTGAGC TTAATGTCTT 2280 

MTCATGTCA TTOGAGAGAA GACACGTTTT GATTCATGCT TTGTGTAATT AATCAATCAA 2340 

GGATTCTTTT TTTAGCTTTG TTGACGTGTA ATTCACCCCT CCTCCTCCAC TGCATATTTA 2400 

AAGCATGTGT TCACACTGTG TGTATACATT CACTGOGATT TTTTOGTTTG CTGCATTGCT 2460 

TGGACTGTTC ATAACATCAC AAGTATTATT CAAATAAAAT ATTAACTGAC CGAAAAAAAA 2520 
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AD2-2B3 
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5 10 15 20 25 30 35 40 45 50 
-€A AtTCC TGggC TCAaG TGATC CICtC otgTC AGICT CCCAA AGTGC TGGGA> 
GA ACTCC TGAOC TCAGG TGATC CGCCC GCCTC AGCCT CCCAA AGTGC TGGGA 



H REG GENE 55 60 65 70 75 80 85 90 95 100 

LSJ l 9fa 9X11 ^ ^ ^ 9CCCA -TC- AtCo- G— IT UToT A-toA> 
AQ2-283 TTACA AGOGT GCAGA CCGCA CCCAG AOGAT TTTAA TTCTT AACNT GCCAT CCTCT ACCCA 

H REG GENE 105 110 115 120 125 130 135 140 
I 220 J AGaAo AaAoo ACcTT AoooT tgtTA gCAAA tocto tGACA> 
AD2-283 AGTAC AGACC ACTTT AGTGT CCCTA TCAAA GGGGG AGACA 



FIG. 17 



AD2 SP6F 

[62] 

EXONI 



110 115 120 125 130 135 140 145 150 
— -AA ttC-t C-CtG cCTCA GCClc gtGog ccGcl GGgAT TACAG GcG> 
AA CCCAA CTCAG ACTCA GCCAA CAGGT AAGTG GCCAT TACAG GAG 



RAT PTP 605 
[ 144 ] <ACTC 
A02-2 T7 ACTC 



RAT PTP 660 655 650 645 640 635 630 625 620 615 610 
LI J L <lcT_0 9goA 9 oGggg GnGA C— I ITGCT TTTGA toGoT GGT-c TaqT- TTCac TTlta 
AD2-2 T7 AGTGT TTTAT TGCTA GTTGA CAAAG CTGCT TTTGA ATGCT GGTGG TTCTA TTCCT TTGAC 



RAT PTP 
[ 144 ] 
A02-2 T7 

RAT PTP 
[144] 
AD2-2 T7 



710 705 700 695 690 685 680 675 670 665 
<oCA-T oCAAt AoTGg oGolA -oooo ToCcA T-A-G GgCAg T— GA GGcA- AgooT GTTIG 
TCAn TCAAA ATTGC TGCCA CTGTT TTCAA TGATG GACAA TCAGA GGGAT ATGCT GTTAG 

745 740 735 730 725 720 715 
<GTg A-Agg toTTT TTolT ToooT gTgco qgqTT 
GTA ATACT CCTTT TTTCT TCTTT CTTTT TTTTT 



HPTPAA 



5 



-FVf cLII> 
FVA SLIK 
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AD2-283 ACAAC COXA GCTAA TATTT GIATT TTTAC TAGAG ATGGG GTTTC TOCAT GTTCA TCAGG 

flS;? 3 15 20 25 30 35 40 45 M 55 60 65 70 
L?5J 01051 ^ AlTCC TC 99C TgAoG 1GATC ClCCo GlCTl gGOCT CCCAA AGTGC TGCCto 
A02-283 CTDGT GTCGA ACICC TGACC WCG TGATC CGOOC GCCTC AGCCT COCM AGTGC TGGW 

l-9o-T72 3 140 145 150 155 160 
I 260 J AG-AC AcA-l A-Tog AlTGo goC-A goAAA> 
AD2-28J AGIAC AGAOC ACTTT AGTGT CCCTA TCAAA 

FIG.17B 



f^f 6 5 10 15 20 25 30 

I? I L — AGA- TcTCg CTC-T G-Tco CCCAG GCTGo ACTGO 

W* 6 ACAG TTTCA CTCTI GCTTG CCCAG GCTCG AGTGC 

f^f 6 40 45 50 55 60 * 70 75 80 85 

m » fiS £2f 5S SJ £® 9 AgCTC *** TCCCG S 0110 Acllc *™ CCTGO 
^GG CACAA TCCIG GCTCA CTGCA ACCTC CGCCC TCCCG AGCTC AAGAA CTTCT CCTGC 

mSm" } Z 105 1,0 1,5 120 125 130 135 140M5 150 155 160 

i, CTCAC aTC " TGAGt 00010 ACAGG CGCcC oCCAC AcGCc qCTAA Ttm GTATT> 

AD2-2SP6 CTCAG CCrCG TGAGC CGCIG GGATT ACAGG CGCGC GCCAC AAG0G ACTM TATTT GTATf 



AD16c-SP6 

I 344 ] TTTGT AO 
A02-2SP6 TTTGT AG 



AD16c-SP6 140 145 150 155 160 165 170 175 180 185 190 195 

l JVx £S? TlTTT GTAT[ TTMG TAGAC ^ GTTTC oCCgT GTTgg cCAGG> 

AD2-283 ACAAC GCCCA GCTAA TATTT CTATT TTTAG TAGAC ATGGG GTTTC TCCAT GTTCA SgG 



ADI6c-SP6 200 205 210 
[ 206 ] oTGcT -CGA tCTCC TGA> 
A02-283 CTGGT GTCGA ACTCC TGA 
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HREGCOC 3610 3615 3620 3625 3630 3635 3640 3645 3650 3655 3660 
[ 118 ] — -CC CC-oo gC-oG tCTlo oTcct GG-cT A-CT GlgTG AGcTG AccTC AogcA CflGCt> 
*D3-4 CC CCTCT ICTTC CGTGG GTTTG GGTAT ATTCT GGTTG AGATG ATATC ATTTA 0GGGG 



HREGCOC 3670 3675 3680 3685 3690 3695 3700 3705 3710 3715 3720 

I IN J GAA0G Cogog ooTcc A-Tcc oCC-T gTITC TgTTc TCCcT gCtTA gctcc AGGgo TgGAA> 

AD3-4 GAAGC CGCn TGTGA AGIAG GCCTT ATTTC ICTTG TCCTT TCGTA CAGGG AGCAT TTGAA 

HREGCOC 3725 3730 3735 3740 3745 3750 3755 3760 3765 3770 

1 US j cTgGg Aelgg GaT-o gAggo eaG-g TGAAG TG-cT GA-IT coggs oATgG oTG> 

AD3-4 GTAGT AGAAC GCTCT TACTC CGGTC TGAAC TCAGI CAOGT GGCTF TATCG TTC 

FIG. 18 



WW-5T7 5 15 20 25 30 35 40 45 50 

[ 90 J GATCC oAGCT ocGTA -CgcG TgcAT GCAcg ICoTo gcTcT TCTAT AGTGT CAO 

A03-4 221 GATCC GAGCT CGGTA CCAAG TTGAT GCATA CCTTG AGTAT TCTAT AGTGT CAC 

J8-4T7 155 150 145 140 135 130 125 120 115 110 . 105 

[ 362 J <gTATg GgCcc gATAg -c-l TAT-l TAgcC TTTAG AGCAC ACTGG CgGCC GTTAC TAGTG 

AD3-4 221 ATATA GACAA TATAA CAATA TATTG TATAC TTTAG AGCAC ACTGG CAGCC GTTAC TAGTG 

J8-4T7 100 95 90 85 80 75 70 65 60 55 50 45 

[ 362 ] <GATCC GAGCT CGGTA CCAAc TTGAT GCATA CCTTG AGTAT TCTAT AGTGT CACct -oAoT 

AD3-4 221 GATCC GAGCT CGGTA CCAAG TTGAT GCATA CCTTG AGTAT TCTAT AGTGT CACTA ATAGT 

FIG.18A 
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C2A-EPT7 
IM8] 
ADJ-4 SFF 



20 25 30 35 40 45 50 55 
-CTT AoTA- gAlAg ctoCT TA — AAAto AcTTA CcC-A cT— GTtTl> 
CTT ACIAC CAGAC AACCT TAGCC AAACC ATTTA CCCAA ATAAA GTATA 



G2A-EP 77 60 65 70 75 80 85 90 95 100 105 110 
J 148 J eGoC- T-G- cTTGA AAoCI otClg AlcAG AcATA GTAU GaAAc cAAlG A— At AcATT> 
AD3-4SRF GGCGA TAGAA ATTGA AA0CT GGCGC AATAG ATATA GTA0C GCAAG GAAAG ATGAA AAATT 

G2A-EP T7 115 120 125 130 135 
I 148 J AT-At oAAG- -TAA- A-gGo AAGGA -gAA> 
AD3-4SPF ATAAC CAAGC ATAAT ATAGC AAGGA CTAA 



55 50 45 40 35 30 25 20 15 10 
[ 182 J <AIA-T oCcCo ooccC CCCAA GoAcA gGGGa ggAoo oGAAA AAAAA AAAAA 
H REG GENE AGACT TCACT CTATC CCCAA GCATA CGGGT CTACT TGAAA AAAAA AAAAA 



H REG GENE 



1182] 



215 210 205 200 195 1901185 180 
-<CT ATAoA GgTcG TTC — TcooC goToA AGCAc GtGAc 
CT ATAGA GATTG TTCAT TTGCC TCTFA AGCAA GAGAT 
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f?-2T7 535 530 525 520 515 510 505 500 495 

i "J * McTo aWc l&l TGGTC TATcA lotto oAAcq 

AD4-4 T7F CAGAA AAATT GCCAA GCTAA ACTAA TGGTC TATAA AAGCG TAATT 

AD2-2T7 490 485 480 475 470 465 460 
I 110 J <TGCAT -T-o GoCAc AAA-T ACAaA oocCA TG— A -AocA oGCcA 
ADMT7F TGCAT GTGTG GGCAT AAACT ACAGA CCTCA TGCTA GAGTA TGCAA 



145 140 135 130 125 120 115 M0 

L\ J, L ' TTG 9 9 ToC " "^999 cccCc cCTA- gAggT cgAcG gTAT- cGA-T 

AD4 " 4 m AA GTTGA TTGCT TCAAA TTTCT GCTAC TAACT TCAAG CTATG GGAGT 

10 l 100 95 90 85 80 75 7 0 65 60 55 50 
!n \ \ L lGTAt cgAoT TccGG ** m 9 cTT - — TT flCTTT TCcTT Tcctg TGooA AgGIT 

A04-4 T7F TTGGC AGTAG TCACT TGAGG ATTTT TTTTC CAATT CGTTT TCATT TTTGT TGTTA AAGCT 

45 <0 35 30 25 20 15 10 
I 142 J <gGTtt TooAG TGAG- -AlAc AclTT ICC— GtogA ACoAg TGITC TAT 
AD4-4 T7F CGTAC TTCAG TGAGA CAGAA AAATT GCCAA GCTAA ACTAA TGGTC TAT 



FIG. 19 
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H REG GDC 5 10 15 20 25 30 35 40 45 

[ 278 J g AATTC CTGGG CIcAA CTGAT CCTCt coTgT coGtC TCCCA AAGTG> 

AD16C-T7-A A MTO CTGGG CTGAA GTGAT CCTCC AGTCT TGGCC TCCCA AAGTG 



SO 55 60 65 70 75 80 85 90 95 100 105 
CTGGG ATgAC AGGCl TGAGC cACco coCCo gcCCo TcATc AgTU TToTA tAooG -oooo> 
CTGGG ATTAC AGGCA TGAGC TACTG AGCCT AGCCT TAATG ATTAA TTTTA GAGTG ATGGC 



.. •«.« wjk no i u uv xa iju |3U | 3J jjjy 

I 278 J ooooA CCH A-oA Altgt TA-gG cooAl AolGA cAAoT TgtAo lo-lo tollc tlACo> 

AD16C-T7-A TTGTA CCTTC AAGCA ACATA TAGAG TTGAG ACAGA AAATT TCCAT CGTCC CGAGA AAACT 

H REG GENE 165 170 175 180 185 190 195 200 205 210 215 220 

[ 278 ] tTtCo -gATl ttlot UUT oooCT ColAA GAotl gotTA oTAAo TAoAA tTtoG IAT— > 

AD16C-T7-A GTGCT CCATG GGCCC CGTGT GCCCT CTGAA GATCG CCCTA TTAAC TATAA ATGGG CATTG 

H REG GENE 225 230 235 

[ 278 ] t-oA- -tcTG l-ctt TToA> 

A016C-T7-A CACAT GGTTG CCAGC TTCA 



H REG GENE 
[ 278 ] 
AD16C-T7-A 



FIG.20 
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AD16C 5 10 J5 20 

I 266 J ■ aACgt TtcGA aOCTo tCgTg> 

Hunon-PTP TACTC TGTGA GCCTG AOCTC 

AD16C 25 30 35 40 45 50 55 60 65 70 

[ 266 J AAG-c -ccCA TTUA GAgtT ootAc -ggGT -gC- ITcAA GggA- ocggg gCTaT -oo-> 
Hunon-PTP AAGCA CAGGA TTCCA GAAAT GGAAG GATGT GCCTT GTGAA GACAA CTTCT CCTTT GTCTG 

A016C 75 80 85 90 95 100 105 110115120 125 130 
( 266 J gAAGT -tltc lACgg GgoGC — oTG GAAAt TtlcT GTCTc oAloT GlgCl tGoAa qTACA> 
Hunan-P IP CAAGT TCAAA AACTA GAGGC AGCTG GAAAA TACAT GTCTA GAACT GATCC ACCAA TTACA 

AD16C 135 140 145 150 155 160 165 170 175 180 185 190 
[ 266 J_ ACcGt oTCtA AAATT AAtCo tt-oo ggCTo ggCtc ogtAg CTClg CCTGl -o-oT CcCoa> 
Human^ip ACGGA CTCAA AAATT AAACC GGACC ATCTC TCCAA CTCAA CTCAA CCTGG ACACT CTCTT 

AD16C 195 200 205 210 215 220 225 230 235 240 

I 266 J CoC-l tTcgG gogGC Cao— gAcTg gaggA TcocT TcAo- ccCflg gAo-t TTcoG AcqCO 

Hunon-PIP CICTG CTGAG TTTGC CTTGT TAATC TTCAA TAGTT TTACC TACCC CAGTC TTTGG AACCT 



205 

■VPCE Dr> 
VPCEDK 

RPTP AA 115 120 

[ 33 J SgSU LyKsW D> 

Tronslolio SSSLG IPKCWD 



AD16C-T7-A 
[33] - 
HPTRAA 
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AUGMENT OF A016C-SP6 cONA WITH AD2-2 SP6 cONA 



AD2-2SP6 40 45 50 55 60 65 70 75 80 85 90 

[ 362 ] CTIGC T-TG- -CCC AGGCT GgAGT GCAoT GGCoC AATCc IGGCT CACTG CoAcC TCCcO 
ADI6C-SP5- CTCGC TCTGT CACCC AGGCT GAAGT GCAGT GGCCC AATCT CGGCT CACTG CGAGC TCCAC 

AD2-2SP6 95 100 105 110 115 120 125 130 135 140 145 

[ 362 ] CTCCC GoGcT CAoga ocTTC TCCTG CCTCA —CCC TC-G TG AGccG CTGGG AITAO 

ADI6C-SP6- CTCCC GGGTT CACTT CATTC TCCTG CCTCA CTGCC TCAGC CTCTG AGTAG CTGGG ACTAC 

AD2-2SP6 150 155 160 165 170 175-180 185 

[ 362 ] AGGCG CgCgC CA-CA oG CgoC TAA-T oTTTG TATTT TT-GT AG> 

AD16C-SP6- AGGCG CCCAC CACCA CGTCC CCTGC TAATT TTTTG TATTT TTAGT AG 

AD2-283 50 55 60 65 70 75 80 85 90 95 100 105 
( 374 ] CCoTG TTcal CAGGc TGGTg TCGAo CTCCT GACCT CGTGA TCCGC CCGCC TcoGC CtCCO 
AD16C-SP6- CCGTG TTGGC CAGGA TGGTC TCGAT CTCCT GACCT CGTGA TCCGC CCGCC TTGGC CACCC 

AD2-2B3 110 115 120 125 130 

[ 374 ] AAAGt G-cTG GGATT ACAoG CGTGO 
AD16C-SP6- AAAGA GTTTG GGATT ACAGG CGTGC 
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AD3-4 AD2-2 AD3-4 




FIG.2IA FIG. 2 IB FIG.2IC 



AD3-4 CLONE 




CCCCACAACCCAAAAC 

#27 NORTHERN 
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G2-2Pst-*l13F SEQUENCE 

Sequence Range: 1 lo 251 

TGCAG CAATG GCAAC AACGT CTGCA AACTA TTAAC TGGCG AACTA CTTAC TCTAC CnCC 60 
OGGCA ACAAT TAATA CACTC GATOC ACCCC CATAA AGTTG CAGGA CCACT TCTGC GCTCG 120 
GCCCT TCOGG CTGGC TGGTT TATTC CTGAT AAATC TGGAG COGGT OGAGC GTGGG TCTCG 180 
OGTAT CATO GACCA CTGGG GCCAG ATGGT AAGGC CTCCG TATCG TACTT ATCTC ACAGC 240 
AGGGA GTCAG G 251 

FIG.22 



G2-2Pst-M13R SEQUENCE 

Sequence Range: 1 to 242 

TGCAG GAGCG GGGAG GCACG ATGGC CGCTT TGGTC CGGAT CTTTG TGAGG AACCT TACTT 60 
CTGTG CTGTG ACATA ATTGG ACAAA CTACC TACAG AGATT TAAAG CTCTA AGGAA ATATA 120 
AAATT TTTAA GTGTA TAATG TGTTA AACTA CTGAT TCTAA TTCTT TGTGT ATTTT AGATT 180 
CCAAC CCTAT CGAAC CTGAT GAATG GGAGC CAGTG GTGGA ATGCC TTTAA TGAGG AAACC 240 
TG 242 

FIG.22A 



G2-2PslKcoRUW3F SEQUENCE 

Sequence Range: 1 to 208 

TGCAG CAATC TTTCT TATAT ACATC CTTAA TAGAT AGCTA CTTAA AATAA CTTAC ACACC 60 
TTTTA GAGTT GCTTG AAAAC TATCT GATCA AGACA TAGTA ATTGA AACCA ATGAA TACAT 120 
TATAT AAAGT AAAGG AAAGG AGAAG AGAGG AAAGG GAGGG GAAGA GGAGA GGGAG GGACA 180 
AGCGA GAAAG GAAAG GGAAG GGAGA AAA onfl 



FIG.22B 
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Gen2-2Pst1-£coRHI13R SEQUENCE 
Sequence Ronge: 1 to 152 

CTCAC TAAAC OGATC AAGGA ATAAT TTTGA ATTTC AACTC TTACA TTTAA TAAAT ACATT 60 



G2-2PslKcoR1-T7 SEQUENCE 
Sequence Ronge: 1 to 338 

AATCT ATCTT ATATA CATGC TTAAT AGATA GCTAC TTAAA ATAAC TTACA CACGT TTTAG 60 
AGTTG CnGA AAACT ATCTC ATCAA GACAT AGTAA TTCAA ACCAA TGAAT ACATT ATATA 120 
AAGTA AAGGA AAGGA GAAGA GAGGA AAGGA GGGGA GAGGA GAGGA GCACA AGCGA GAAAA 180 



G5dPst-M13R SEQUENCE 

Sequence Ronge: 1 to 169 

TGCAG GAGTG GGGAG GCACG ATGGC CCCTT TGGIC CGGAT CTTTG TGAAG GAACC TTACT 60 
TCTGT GTGTG ACATA ATTGG ACAAA CTACC TACAG AGATT TAAAC GTCTA AGGTA AATAT 120 
AAAAT TTTTA GTGTA TAGGT TAAAC TACTG ATTCT AATGT TGTCT ATTT 169 




FIG.22C 



GGAAG GGAAC GGAGA AAAAG 
ATATA GTAAG TGTAC ATTTT 
GTACA TGTGC ATTTT ACTGC 



GGGGA AAGGG AGGTA GAGAG AGAGA GAAAA AGTGC TGGTC 240 
AACTT TTTAA GAAAC TACCC TACTC TATTC CAGAG TGATT 300 
ATTAT AGAGA TCATT nC m 
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C5d Pst-T71 SEQUENCE 

Sequence Ronge: I to 209 

OOCCC OCCTC CAGCA ATGGC AACAA CCTCT GCAAA CTATT AACTG GCGAA CTCAT TCATC 60 
TAGCT TCCOG GCAAC AATTA ATGAC TGGAT GGAGG CGGAT AAAGT TGCAG GACCA CTTCT 120 
CGCGT GGCCC TTCCG GCTGG CTGCT TTATT GCTGA TAATT GAGCC TGCGA GTGCC TCGCG 180 
TAICA TICGC GACAT GGGCC AGTAG GTAC 2 09 

FIG.22F 



G5dP$lKcoR1-SP SEQUENCE 

Sequence Ronge: 1 to 272 

CTTGC CCTTC ATGGA CTCAT ACAGC CGATC AGCAA AATGC AGGGC CTTGT TCTGA ATGCA 60 
CTGAA CCAGG TTCAG GAAAC CATTT TCCAG GTCTC CTTTA ACCIC TTTCC TGATG CTTTC 120 
CAACA TGTCA TAAGG GCTGT AACTC TTGTA CCTAT CAAAT ACTTT CTGGA GGTGG GGACA 180 

s s s s ss s r ™ c ** mcc ,crrc m » 

FIG.22G 



G5dPstl-EcoRl-t7 SEQUENCE 

Sequence Range: 1 to 278 

ACGGC CCAGC TTCCT TCAAA ATGTC TACTG HCAC GAAAT CCTCT CCAAG CTCAC CTTGG 60 
AGGGT GATCA CTCTA CACCC CCAAG TGCAT ATGGG TCTGT CAAAG CCTAT ACTAA CTTTC 120 
ATGCT GAGCG GGATG CTTTG AACAT TGAAA CACCC ATCAA GACCA AAGCT CTGGA TGAGG 180 
TCACC ATTGT CAACA TTTTG ACCAA CCGCA CCMT CACAC GAGAC AGGAT ATTGC CTTCG 240 
CCTAC CAGAG AAGGA CCAAA AAAGG AACTT GCATC ACA 278 

FIG.22H 
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AUGMENT Of G2-2Pst1 with HUMAN REG GENE (1) 



H REG GENE 3405 3410 3415 3420 3425 3430 3435 3440 3445 3450 3455 
[ 228 ] — AG CAATo GCAA- -AgGo ooGgA AAC-A oTAll ToGC- AA-gg tTTAl TCTtc CTTtq: 
G2-2PsHfl3F AG CAATC GCAAC AADGT CTGCA AACTA TTAAC TGGCG AACTA CTTAC TCTAG CTTOC 



H REG GENE 34653470 3475 3480 3485 3490 3495 3500 3505 3510 

f 228 J tGtCA gCAlT TctgA GlgTG cAcoc AGCCc cAgtg A-TTc CAtG- -lAtT TITGo G-T-G> 

C2-2Pst-UI3F CGGCA ACAAT TAATA GACTG GATGC AGCCC CATAA AGTTG CAGGA CCACT TCTGC GCTCG 



H REG GEN3515 3520 35253530 3535 3540 3550 3555 3563555 3575 3580 
[ 228 J oCCoc TgCcl CTGIC TGG-c ccTTc CccAT AgAoC cGccG CIGGT gGAGC GTCGC TCcCl> 
G2-2Pst-MJ3F GCCCT TCCOG CTGGC TGGTT TATTC CTGAT AAATC TGGAG CCGCT CGAGC GTGGG TCTCG 



H REG GENE 3585 3590 3595 3600 3605 3610 3615 3620 3625 3630 

I 228 J gGTcT CcToC oAGtc CTGGG G-CA- tTGG- -AGCC CcoaG cAHJ T-CTT A-oTC clgGO 

G2-2Pst-4il3F CGTAT CAnC GAG 0 CTGGG GCCAC ATGGT AAGCC CTCCC TATOG TAGTT ATCIC ACAGC 

EXDN5 45 50 55 60 65 70 75 80 85 90 95 

[ 124 ] gGTcT CcToC oAGlc CTGGG G-CA- tTGG- -AGCC CcooG cA-C T-GTT A-oTC ctgGO 

C2-2Pst-U13F CGTAT CATTC GAG Q CTGGG GCCAC ATGGT AAGCC CTCCG TATCG TAGTT ATCTC ACAGC 



H REG GENE3635 3640 
[ 228 ] ActGl CTqAO 
G2-2Pst-M13F AGGGA GTCAG 

EX0N5 100 105 

[ 124 ] ActGl GTgAG> 
G2-2PsHJo3F AGGGA GTCAG 



EXQN 
[124] 

G2-2Pst-N13F 



20 25 30 35 40 
-C TGGco CtGG- oG-co GTGGG TCcCt> 
C TGGAG CCGCT CGAGC GTGGG TCTCG 
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AUGMENT OF G2-2Pst wilh HUMAN REG GDC (2) 

H REG GENE 3155 3160 3165 3170 3175 3180 3185 3195 3200 

L ] lt > ""^ QtDoc llGlG GtA ~° A°olC tGCTg ctGTo CtCcT CoTT- TG-GG AAOCT TA-oT> 

C2-2PsHJI3R AG GAGCG GGGAG GCAOG ATGGC CGCTT TCGTC CGCAT CTTTC TGAGG AAOCT TACTT 

HREGGEN3210 3215 3220 3225 3230 3235 3240 3245 3250 3255 3260 3265 

L li oTocl oooTo mU A ~ Tot AlcAA CoACt oAlgG tcAgc cAAlG CToTo ctG-o ATATo> 
C2-2P S l-W13RCTGTG CTGTG ACATA ATTGG ACAAA CTACC TACAG AGATT TAAAG CTCTA UGM 3 

H REG GENE 3270 3275 3280 3285 3290 3295 3300 3305 3310 3315 3320 

iV™ J ,m,„ A "SI CcT_g GgccA OCfloA CoG 9° oCcAc TT-TT TooGT qoqoT ActTT> 
C2-2PsH«3RAAATT TTTAA GTGTA TAATG TGTTA AACTA CTCAT TCTAA TTGTT TGTCT ATTTT AGATT 

HREG CEN3J25 3330 3335 3340 3345 3350 3355 3360 3365 3370 3375 
I 194 J gggtc ICTgT -cAAo ITcAT oAcoc tlAtt tctTG GTGGA AToCo qTTAA TGAG> 
G2-2Psmj3RCCAAC CCTAT GGAAC CTGAT GAATG GGAGC CAGTG GTGGA ATGCC TTTAA TGAG 



G2-2P S l-« fl 3F 150 155 160 165 170 175 180 185 190 195 

nJ™ J I A ° AlCl GgoGC ^ " TdJA ^ C^TC lC 9cG ToTCo TICgA GcoCT> 

EX0N5 TA GAACC GCCGC TGGCA CTGGA GCAGT GGGTC CCTGG TCTCC TACAA GTCCT 

G2-2psl-MI3F 200 205 210 215 220 225 230 235 240 245 250 

ktH ] "2°° A ~ l " ccTcc 9 MTC 9 ToCl TA " T c TcocA GCogG gogTC AqG> 

EX0N5 GGGGC ATTGG ACCCC CAAGC AGTGT TAATC CTGGC TACTG TGTGA GCCTG ACCTC AAG 
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ALIGNMEXF OF G2-2PST-EC0R1-T7 WITH HUMAN REG GENE 



"REGGOC 335 340 345 350 355 360 365 

MJ2 ] — ; AT AGATA ICTAC TTtAl IcgAl TTAoA UCTG -TTTA> 

G2-2PSI-£C0R1 T7 AT AGATA GCTAC TTAAA ATAAC TTACA CACTG TTTTA 

H REG GENE 370 375 380 385 390 395 400 405 410 415 
f 132 J — GTo ITTIA UA-T ATlTt gUA- A-T- ccAtT TG-lc CCAAT tcATA tAcT- TAT> 
G2A-£P T7 GAGTG CTTGA AAACT ATCTG ATCAG ACATA GTAAT TGAAA CCAAT GAATA CATTA TAT 



f 1 ™ 740 735 

mct ■ ^ J!?!! 

ACATT TAAAT 

PL? r 730 725 720 7,5 7,0 705 7 <>° «* 690 685 
I BZ J <— Til ollTA -AAlg IgCA- ggGTT -ocA- ToCoo ToATq gAqoT AAAAA TAcco TAoon 
Gen2oEP-«o CATAA GGCTA TAACT ACCAT ACGTT CTCAT TTCTC TGATT AATTT AAAAA TAUT TMAA 



RATPTP 68675 670 665 660 655 

{ 82 ] <CgoGG cAAGA ATgTT — lg TCTAG GAAG 

C2-2PST-EC0R1-WI CCTGG AAAGA ATTTT ACCAT TCTAG GAAG 



FIG.23B 
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Hunan-PIP 270 260 285 290 295 300 305 310 315 

[ 166 ] AcCTG GtGct gTgCT cAC-C cACgc cgoGG glgC- cTTtg TgCcC T-cAc lGAtl> 

G5dPst-t AACTG GOGAA CTACT TACTC TAGCT TOOGG CAACA ATTAA TAGAC TGGAT GGAGG 

Hunan-PTP320 325 330 335 340 345 350 355 360 365 370 

[ 166 ] ooG-g AgAGT gGCAt GAtgA CTTCo olG-T CIG-g oTT- GGC— -CTcc oTgAc cc€oa> 

G5dPst-l CGGAT AAAGT TGCAG GACCA CTTCT GCGCT CGGCC CTTCC GGCTG GCTGG TTTAT TGCTG 

Hunon-PTP 375 380 385 395 400 405 410 415 420 425 430 435 

f 166 ] AoAgA oCcGc cGCiG GgGAG CCTGG GTCcC IgGTc TCcTo caAGt cCIGG GG-CA -ITGG> 

C5dPsl-t ATAAA TCTGG AGCCG GTGAG OGTGG CTCTC GCGTA TCATT GCAGC ACTGG GGCCA GATGG 

Hunon-PTP 440 445 450 455 460 465 470 

[ 166 ] — AGC CCcoo GcA— GTGtT cATCc tggCl ACtGl GtG-A -GcO 

G5dPsl-t TAAGC CCTCC GTATC GTGGT TATCT ACACG AOGGG GAGTA CGGC 



FIG.23C 
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V ™ ?° C 80 85 90 95 100 105 110 115 

knttL * ATco1 C °9 n T ~ T " lolAT 000 9° ««CC TTAoo> 
C5dPsl " U AC GCAOC ATGGC CGCTT TGCTC COGoT CTTTG TCAAG CAACC TTACT 

H EC GENE 120 125 130 135 140 145 150 155 160 165 170 175 

JL!7.'.. £1? lflfi9C MSK cTolG X*** ~ J W °AUt AlATI cttAC oTtTc AGoT- -tUT> 
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